1 CATTGGAGAC CATGGATAAG TACGATGTGA TTAAGGCCAT CGGGCAAGGT 
51 GCCTTCGGGA AAGCATACTT AGCTAAAGGG AAATCAGATA GCAAGCACTG 
101 TGTCATAAAA GAGATCAATT TTGAAAAGAT GCCCATACAA GAAAAAGAAG 
151 CTTCAAAGAA AGAAGTGATT CTTCTGGAAA AGATGAAACA TCCCAACATT 
201 GTAGCCTTCT TCAATTCATT TCAAGAGAAT GGCAGGCTGT TTATTGTAAT 
251 GGAATATTGT GATGGAGGGG ATCTCATGAA AAGGATCAAT AGACAACGGG 
301 GTGTGTTATT TAGTGAAGAT CAGATCCTCG GTTGGTTTGT ACAGATTTCT 
351 CTAGGACTAA AACATATTCA TGACAGGAAG ATATTACACA GGGACATAAA 
401 AGCTCAGAAC A 11 1 I I CTTA GCAAGAACGG AATGGTGGCA AAGCTTGGGG 
451 ACTTTGGTAT AGCAAGAGTC CTGAATAATT CCATGGAACT TGCTCGAACT 
501 TGTATTGGAA CACCTTACTA CCTGTCCCCA GAGATCTGTC AGAATAAACC 
551 CTACAACAAT AAAACGGATA TTTGGTCTCT TGGCTGTGTC TTATATGAGC 
601 TCTGCACACT TAAACATCCT TTTGAGGGTA ACAACTTACA GCAGCTGGTT 
651 CTGAAGATTT GTCAAGCACA TTTTGCCCCA ATATCTCCGG GGTTTTCTCG 
701 TGAGCTCCAT TCCTTGATAT CTCAGCTCTT TCAAGTATCT CCTCGAGACC 
751 GACCATCCAT AAATTCCATT TTGAAAAGGC CCTTTTTAGA GAATCTTATT 
801 CCCAAATATT TGACTCCTGA GGTCATTCAG GAAGAATTCA GTCACATGCT 
851 TATATGCAGA GCAGGAGCGC CAGCTTCTCG ACATGCTGGG AAGGTGGTCC 
901 AGAAGTGTAA AATACAAAAA GTGAGATTCC GGGGAAAGTG CCCACCAAGA 
951 TCAAGGATAT CTGTGCCAAT TAAAAGGAAT GCTATATTGC ATAGAAATGA 
1001 ATGGAGACCA CCAGCTGGAG CCCAGAAGGC CAGATCTATA AAAATGATAG 
1051 AAAGACCCAA AATTGCTGCT GTCTGTGGAC ATTATGATTA TTATTATGCT 
1101 CAACTTGATA TGCTGAGGAG GAGAGCCCAC AAACCAAGTT ATCACCCTAT 
1151 TCCTCAAGAA AATACTGGAG TTGAGGATTA CGGTCAGGAA ACGAGGCATG 
1201 GTCCATCCCC AAGTCAATGG CCTGCTGAGT ACCTTCAGAG AAAATTTGAA 
1251 GCTCAACAAT ATAAGTTGAA AGTGGAGAAG CAATTGGGTC TTCGTCCATC 
1301 TTCTGCCGAG CCAAATTACA ACCAGAGACA AGAGCTAAGA AGTAATGGAG 
1351 AAGAGCCTAG ATTCCAGGAG CTGCCATTTA GGAAAAACGA AATGAAGGAA 
1401 CAGGAATATT GGAAGCAGTT AGAGGAAATA CGCCAACAGT ACCTCAATGA 
1451 CATGAAAGAA ATTAGAAAGA AGATGGGGAG AGAACCAGAG GACATTGAAA 
1501 AAGACTTGAA ACAAATGAGG CTTCAGAACA CAAAGGAAAG TAAAAATCCA 
1551 GAACAGAAAT ATAAAGCTAA GAAGGGGGTA AAATTTGAAA TTAATTTAGA 
1601 CAAATGTATT TCTGATGAAA ACATCCTCCA AGAGGAAGAG GCAATGGATA 
1651 TACCAAATGA AACTTTGACC TTTGAGGATG GCATGAAGTT TAAGGAATAT 
1701 GAATGTGTAA AGGAGCATGG AGATTATACA GACAAAGCAT TTGAAAAACT 
1751 TCACTGCCCA GAAGCAGCAT TTACAGAACT GACTTGGCTC AGTTTCCTCT 
1801 TCCTGGAATA CTCTCTGCCT CATTTCCTTC TGGAAAAATC TCCATTCAGC 
1851 AGGCATCTTA TTGAGGATCT CCTTTGTGCC AACGACTGCT CACTGAAGGA 
1901 CTGGAGTGAG AAGGAAATGG AGCTTAGGAC ATAACCCTAC CACTACATAA 
1951 ACAAACTTTG GAGAATCAGG AGAGAGTAAA GCCAAAGGAG GAGAGACAGG 
2001 TCATGGGGAG GCACAGGAAT TGGCAGCATC AACTGGAAGA GAAAGGCCAG 
2051 ATGAGGGTTT TCCACGCAGA CTGTAGTTGC TGTGGGAAAC AGGAGGCAGT 
2101 GGGATGGAGG AGCGCCTCAG ACTCTGCTGC AGATGATGGC AGTGGCCGAC 
2151 ATCACCTCCA CCTGCCCCAC GGGGCCTGAC AGTGAGTCTG TGCTTAGTGT 
2201 CAGTCGTCAG GAAGGGAAGA CCAAGGACCC GTACAGCCCA GTGCTCATCC 
2251 TGATGTGATA GTCTACTTCT CACTATACAC CCTATAGATC TTGTATCAGA 
2301 CACTTTCAAA TATGTTGTTT TGATATCTCA AGAAAAAAAA AAAAAAAACA 
2351 CTGTCATGCC GTTACGAGCG (SEQ ID NO:l) 

FEATURES : 

5'UTR: 1-11 

Start Codon: 12 
Stop Codon: 1932 
3 , UTR: 1935 
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Homologous proteins: 

Top 10 BLAST Hits 



CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



18000004898313 /altid=gi 
18000005213056 /altid=gi 

18000005207774 /altid=gi 
18000004929662 /altid=gi 
108000024647614 /altid=gi 
18000005131165 /altid=g? 
18000005199482 /altid=gi 

18000005207775 /altid=gi 
18000004925928 /altid=gi 
98000043621119 /altid=gi 



1709251 /def=sp|P51954|NEKl_MOUSE . 
9297022 /def=sp|Q9R0A5|NEK3_MOUSE . 
6754820 /def=ref |NP_035978.1| NIMA. 
1709253 /def=sp|P51956|NEK3_HUMAN . 
112729925 /def=ref |XP_003216. 2 | se. 
2570047 /def=emb | CAA70436.il (Y092. 
7522642 /def=pi r| I 3C7122 serine (t. 
6754822 /def=ref | NP_03 5979.1 NIMA. 
4507277 /def=ref |NP_003148.1 seri . 
12852471 /def =dbj | BAB29424 . 1 (AK0 . 



BLAST dbEST hits: 

gi 1 12358928 /dataset=dbest /taxon=96... 

gi 14310159 /dataset=dbest /taxon=9606 ... 

EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

From BLAST dbEST hits: 

gi 112358928 Lung carcinoma 

gi 14310159 Lung carcinoma 

library source of cdna clone: 
placenta 



Score 




389 


e-107 


311 


2e-83 


311 


3e-83 


285 


2e-75 


256 


le-66 


. 254 


3e-66 


254 


3e-66 


254 


3e-66 


. 252 


le-65 


244 


4e-63 


252 


3e-64 


252 


3e-64 
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1 MDKYDVIKAI GQGAFGKAYL AKGKSDSKHC VIKEINFEKM PIQEKEASKK 
51 EVILLEKMKH PNIVAFFNSF QENGRLFIVM EYCDGGDLMK RINRQRGVLF 
101 SEDQILGWFV QISL5LKHIH DRKILHRDIK AQNIFLSKNG MVAKLGDFGI 
151 ARVLNNSMEL ARTCtGTPYY LSPEICQNKP YNNKTDIWSL GCVLYELCTL 
201 KHPFEGNNLQ QLVLKICQAH FAPISPGFSR ELHSLISQLF QVSPRDRP5I 
251 NSILKRPFLE NLIPKYLTPE VIQEEFSHML ICRAGAPASR HAGKWQKCK 
301 IQKVRFRGKC PPRSRISVPI KRNAILHRNE WRPPAGAQKA RSIKMIERPK 
351 IAAVCGHYDY YYAQLDMLRR RAHKPSYHPI PQENTGVEDY GQETRHGPSP 
401 SQWPAEYLQR KFEAQQYKLK VEKQLGLRPS SAEPNYNQRQ ELRSNGEEPR 
451 FQELPFRKNE MKEQEYWKQL EEIRQQYLND MKEIRKKMGR EPEDIEKDLK 
501 QMRLQNTKES KNPEQKYKAK KGVKFEINLD KCISDENILQ EEEAMDIPNE 
551 TLTFEDGMKF KEYECVKEHG DYTDKAFEKL HCPEAAFTEL TWLSFLFLEY 
601 SLPHFLLEKS PFSRHLIEDL LCANDCSLKD W5EKEMELRT (SEQ ID NO: 2) 

FEATURES : 

[1] PDOC00001 PS00001 AS N_G LYCOS YLATI ON 

N-glycosylation site 
Number of matches: 3 

1 155-158 NNSM 

2 183-186 NKTD 

3 549-552 NETL 

[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 
Number of matches: 7 

1 48-50 SKK 

2 199-201 TLK 

3 243-245 SPR 

4 342-344 SIK 

5 573-575 TDK 

6 627-629 SLK 

7 632-634 SEK 

[3] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 
Number of matches: 9 



1 


48-51 


SKKE 


2 


69-72 


SFQE 


3 


243-246 


SPRD 


4 


385-388 


TGVE 


5 


430-433 


SSAE 


6 


444-447 


SNGE 


7 


553-556 TFED 


8 


627-630 


SLKD 


9 


632-635 


SEKE 



[4] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 
Number Of matches: 2 

1 410-417 RKFEAQQY 

2 511-517 KNPEQKY 

[5] PDOC00008 PS00008 MYRISTYL 

N-myristoylation site 
Number of matches: 4 

1 97-102 GVLFSE 

2 285-290 GAPASR 

3 336-341 GAQKAR 
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4 426-431 GLRPSS 



[6] PDOC00017 PS00017 ATP_GTP_A 
ATP/GTP-binding site motif A (P-loop) 
18-25 AYLAKGKS 

[7] PDOC00100 PS00107 PROTEIN_KINASE_ATP 
Protein kinases ATP-binding region signature 
10-33 IGQGAFGKAYLAKGKSDSKHCVIK 

[8] PDOC00100 PS00108 PROTEI N_KI N AS E_ST 
Serine/Threonine protein kinases active-site signature 

124-136 ILHRDIKAQNIFL 



Membrane spanning structure and domains: 
Helix Begin End score Certainity 
1 587 607 0.898 Putative 



BLAST Alignment to Top Hit: 

>CRA | 18000004898313 /altid=gi | 1709251 /def=sp| P51954|NEKl_MOUSE 
SERINE/THREONINE- PROTEIN KINASE NEKl (NIMA-RELATED 
PROTEIN KINASE 1) /org=NIMA- RELATED PROTEIN KINASE 1 

/dataset=nraa /length=774 
Length = 774 

Score = 389 bits (988), Expect = e-107 

identities = 239/631 (37%), Positives = 350/631 (54%), Gaps = 
74/631 (11%) 
Frame = +1 

Query: 1 mdkydvikaigqgafgkaylakgksdskhcvikeinfekmpiqekeaskkevillekmkh 180 

M+KY ++ IG+G+FGKA L K D +H VIKEIN +M +E++ S++EV +L MKH 

Sbjct: 1 MEKYVRLQKIGEGSFGKAVLVKSTEDGRHYVIKEINISRMSDKERQESRREVAVLANMKH 60 

Query: 181 PNIVAFFNSFQENGRLFIVMEYCDGGDLMKRINRQRGVLFSEDQILGWFVQISLGLKHIH 360 

PNIV + SF+ENG L+IVM+YC+GGDL KRIN Q+G LF EDQIL WFVQI L LKH+H 
Sbjct: 61 PNIVQYKESFEENGSLYIVMDYCEGGDLFKRINAQKGALFQEDQILDWFVQICLALKHVH 120 

Query: 361 DRKILHRDIKAQNIFLSKNGMVAKLGDFGIARVLNNSMELARTCIGTPYYLSPEICQNKP 540 

DRKILHRDIK+QNIFL+K+G V +LGDFGIARVLN+++ELARTCIGTPYYLSPEIC+NKP 
Sbjct: 121 DRKILHRDIKSQNIFLTKDGTV-QLGDFGIARVLNSTVELARTCIGTPYYLSPEICENKP 179 

Query: 541 YNNKTDIWSLGCVLYELCTLKHPFEGNNLQQLVLKICQAHFAPISPGFSRELHSLISQLF 720 

YNNK+DIW+LGCVLYELCTLKH FE N++ LVLKI F P+SP +S +L SL+SQLF 
Sbjct: 180 YNNKSDIWALGCVLYELCTLKHAFEAGNMKNLVLKIISGSFPPVSPHYSYDLRSLLSQLF 239 

Query: 721 QVSPRDRPSINSILKRPFLENLIPKYLTPEVIQEEFSHMLICRAG 855 

+ +PRDRPS+NSIL++ F+ I K+L+P++I EEF + + G 
Sbjct: 240 KRNPRDRPSVNSILEKGFIAKRIEKFLSPQLIAEEFCLKTLSKFGPQPLPGKRPASGQGV 299 

Query: 856 APASRHAGKWQKCKIQKVRFRGKCPPRSRIS--VPIKRNAILHRNEWRP 999 

PA+++ + K K K PP+ + + +P+K+ ++ E R 
Sbjct: 300 SSFVPAQKITKPAAKYGVPLTYKKYGDKKLLEKKPPPKHKQAHQI PVKK MNSGEERK 356 

Query: 1000 PAG - -AQKARSIKMI ERPKIAAVCGHYDYYYAQLDMLRR RAHKPSYHPIP 1143 

A K R ++ IE+ K + + + ++R RA + + + 

Sbjct: 357 KMSEEAAKKRRLEFIEKEKKQ KDQIRFLKAEQMKRQEKQRLERINRAREQGWRNVL 412 

Query: 1144 QENTGVE DYGQETRHGPSPSQWPAEYLQRKFEAQQYKLKVEKQLGLRPSSAEPNYN 1311 

+ E +G PSP +Y + Y ++ LR E + 

Sbjct: 413 RAGGSGEVKASFFGIGGAVSPSPCSPRGQY EH YHAI FDQMQRL RAEDNEARWK 465 

Query: 1312 QRQELRSNGEEP-RFQELPFRKNEMKEQE YWKQLEEIRQQYLN 1437 
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Sbjct: 466 



+RQ+ E + +E RK E + + Y +L +IR Q N 

GGIYGRWLPERQKGHLAVERANQVEEFLQRKREAMQNKARAEGHWYLARLRQIRLQNFN 525 



Query: 1438 DMKEIRKKMGREPEDI EKDLKQM R LQNTK ES KN P EQ K YKAKKGVK F E 1578 

+ ++I+ K+ E ++ E D++ ++++ K N K + ++ K 

Sbjct: 526 ERQQIKAKLRGENKEADGTKGQEATEETDMRLKKMESLKAQTNARAAVLKEQLERKRKEA 585 

Query: 1579 INLDKCISDENILQEEEAMDIPNETLTFEDG 1671 

+K + +E+++ ++ D+P E G 

sbjct: 586 YEREKKVWEEHLVARVKSSDVPLPLELLETG 616 (SEQ ID N0:4) 



HMM results: 



scores for sequence family classification (score includes all 
domains) : 

Model Description 

PF00069 Eukaryotic protein kinase domain 

CE00022 CE00022 MAGUK_subfami ly_d 

CE00031 CE00031 VEGFR 

CE00359 E00359 bone_morphogenetic_protein_receptor 

CE00203 CE00203 ERBB.RECEPTOR 

CE00220 E00220 ACTIVIN_RECEPTOR 

CE00287 CE00287 PTK_Eph_orphan_receptor 

CE00292 CE00292 PTICjnembrane_span 

CE00291 CE00291 PTK_fgf_receptor 

CE00289 CE00289 PTK_j>DGF_receptor 

CE00290 CE00290 PTK_Trk_fami 1y 

CE00286 E00286 PTK_EGF_receptor 

CE00288 CE00288 PTK_lnsul i n_receptor 

CE00016 CE00016 GSK_glycogen_synthaseJdnase 

Parsed for domains: 



Score 


E-val ue 


N 


299.8 


3.4e-86 


1 


28.3 


3e-08 


1 


21.6 


1.3e-06 


1 


8.8 


0.081 


1 


2.2 


2.9 


1 


2.0 


5.7 


1 


-2.0 


4.3e-08 


1 


-9.2 


1.2e-08 


1 


-50:8 


8.8e-06 


1 


-64.6 


0.17 


1 


-80.7 


6e-10 


1 


-84.2 


5.8e-06 


1 


-175.5 


2.9e-05 


1 


-218.2 


0.00014 


1 



Model 


Domai n 


sea-f 


sea-t 


hmm-f 


hmm-t 


score 


E- value 


CE00289 


1/1 


3 


103 .. 


1 


109 


1 -64 


6 


0.17 


CE00220 


1/1 


124 


141 .. 


340 


357 




2 


0 


5.7 


CE00203 


1/1 


106 


154 .. 


843 


890 .. 


2 


2 


2.9 


CE00359 


1/1 


124 


175 


272 


326 .. 


8 


8 


0.081 


CE00022 


1/1 


120 


195 .. 


138 


217 .. 


28 


3 


3e-08 


CE00031 


1/1 


111 


200 .. 


1054 


1144 .. 


21 


6 


1.3e-06 


CE00286 


1/1 


7 


236 .. 


1 


263 




-84 


2 


5.8e-06 


CE00288 


1/1 


6 


253 .. 


1 


269 




-175 


5 


2.9e-05 


CE00287 


1/1 


4 


254 .. 


1 


260 




-2 


0 


4.3e-08 


CE00291 


1/1 


8 


257 .. 


1 


285 




-50 


8 


8.8e-06 


CE00292 


1/1 


6 


257 .. 


1 


288 




-9 


2 


1.2e-08 


CE00290 


1/1 


7 


257 .. 


1 


282 




-80 


7 


6e-10 


PF00069 


1/1 


4 


259 .. 


1 


278 




299 


8 


3.4e-86 


CE00016 


1/1 


1 


324 [. 


1 


433 




-218 


2 


0.00014 
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1 CTTGGCAGGC CGCCGCTGTG 
51 ACGTCCACCT TTTGGCAATA 
101 TGTATCAGTT TAGCTCATTT 
151 TTGTAAAGAA AAGAGAGAAC 
201 AGAGCGGGGA GCGCTACTGT 
251 GGCAACGGGA CGGTTGCGGG 
301 CGCCTGACCC TGCCTCGAGG 
351 CGCCCCGCCC GCGTTTCCTT 
401 CAGCGGCCCC GGGAACCCGG 
451 GAGCCGGCGG AGCAGGTACG 
501 GAGCCGCCAC AGGGCTTCGG 
551 GTTTCGGCCT GGGGCGGCGG 
601 GCTGTGTGGG TGTAGCGTCC 
651 ACACGCGGAG GTCGGGTACG 
701 CTTCCAGGGG CGGCCCTAGG 
751 CCACCCTTTG GTGCCACTTG 
801 GCGAGTCACC CCCGGCCCCA 
851 CCTCCGGTGT CTGCAGCCTC 
901 TGCAACCCAG TACCGACTTC 
951 GCCCCCTGCT CACCCCATTA 
1001 ACTGTAGCAT AAAGAGGAAA 
1051 TATTTCAATC TGAAAATGCT 
1101 AGTAGCCAGT AGCCGCGCCT 
1151 TGCAGATTGT TGGGTCTCCG 
1201 GTGGTTTTCC CAAATGGTGA 
1251 GCAATCAAGC CTTGATTTAC 
1301 AGTGTTCATT AAAACTGCAA 
1351 TTTGGTTCTA GCCACAAACG 
1401 GACTCTCCAA GATTGTAGGG 
1451 GTCTCTCGAA TCGCAGAATC 
1501 CCTCATGCCT CTGAATCTTG 
1551 CCAGAATTCC CCCTAGGGGG 
1601 ATAAAAGTGC ACAAGCCTAA 
1651 ACCGGATATA GTATCTTTCA 
1701 GTTATTCCAT TTTAGAAATT 
1751 CTTACCCGAG AATACAGGGC 
1801 TGTCTGACCA AGAAACTCTG 
1851 AAGTTATTTG TCTCTGTCTT 
1901 TGAGTTCTTT TACACTTTAG 
1951 TGAGACGTGT GTGATAAATT 
2001 CATTGGAGAC CATGGATAAG 
2051 GCCTTCGGGA AAGCATACTT 
2101 TGTCATAAAA GAGATCAATT 
2151 GTTAATTTTC AGTGGGATAT 
2201 AAA I I I I IAT I I I I IATAAT 
2251 TAATTCAACC TCCTAAGTCC 
2301 TTTAAAAAGA GGCTGGACGC 
2351 GGATGGCCAA GGTGGGAGGA 
2401 GGGCAACATA GCAAGACCCT 
2451 AAATAGAAAA GAAATAAGAT 
2501 AGGGTGCCCT GCAAGGCCCT 
2551 C I I I I CTCTG TGGTGCACTT 
2601 GGTTTTCTGC AAA CATC CT A 
2651 CTTATCACCT TATCACACCA 
2701 TTTCTTTTCT I I I I I IAGAC 
2751 GCAGTGGCAT GATCTTGGCT 
2801 AATTCTCCCA CTTCAGCCTC 
2851 CCATGCCTGG ATAA 1 1 I I I I 



GCCCAAAGAG TAGGAAGCCG TTCCAGTCTC 
TTTGAGACCT TGTACAAGAA ACACTCTTCC 
GTAAAACTGG GAGACTACTG CCTTGACGGG 
GTTTGCGAAG CGTCTGGTGC ACCTTAAGCA 
AGACTGCAAA GCAAAGGAAT CCCGACCCAA 
GTGACTCTGC CGGGTCTCCA AACTCCCTGG 
TGGACTGGTC CCCAGGCCAT TCCAGACCCG 
GCGCGGCTCC GCCCCGGCCG CAGGGAGGCG 
ATCCTTCCGG GACGCTTCGT TGGCCCCGCG 
CTTGCAGGGG CCGCCCTTAG TTCTTGCCCG 
GAGCTCGGCA GGGTGGGGGA AAGGGATGGA 
GGGCGGCCCA GAAAAGGCCT AGCGTCCTGG 
AGGGCGCGTC GGTCTCT ATG GCAACGCTCC 
GGTAAGCGTC TTGCCACTCA CCCGCGGCCG 
GGAGAAGGAA TTTTCCTAAT TTGGGGGCTT 
GGCGGGAGGG TCGCGGGCCC TCAGTTCCCG 
AGTCCGTATG CGTCTCTCTC AGAACCCGAT 
TCCTGGCTGC GGAGCTGGTT CCCAGCCCCC 
CCACCCTGAC GTAAAATTAT TCGAAAACAA 
ACAACAACAA CAAAACTGTA TTATGCCCTA 
TAGAAGGAAA GCAATAAGTA AGAAAGTACA 
TGGCACTACT ACCCTTGGAA AATGTAGAGA 
GGGGAGTCGC CTGAACGTGA CGGCAGCAAA 
GGACCAGGAG CAGCGTGGCC AGTGAAGCGC 
ACAATTCTTG GTAAACCTCC AAACCGAAGT 
ATGTAGTTGC ATTCCTCGAA AAAAAAAAGA 
AAATACTTAG CATTTCGATG TAAAATAGAG 
GA I I I I I CCA CGCACAGGAA TGTATAGGAA 
CCGCGGGGTA ATCCTTTATT GTGCGGGACT 
CTACCATCTC AGGCCCCAAC CACCTGTAAA 
GGGAAACAGC TTCCCCACCC CCATATATTT 
CAGTACGTCC CCACTAAGAA AGGCTGAACT 
GGACATTCCT GCTTTATAAA GGTGCGAAAC 
TTCTCAGAAC AAACTTGCAA AACAGGTATT 
AGGAAAGTGA GGTTTTGCCA GGTTAAGTGA 
AAAAGTGTAT CAAAGCTGAG CTATGACCCG 
TCTCATTTCA GTTATCTGTG GCCACAAAGA 
GGCAAGGCTG GGAGGAAAGT TTTAGCTAAG 
TCATCAGTTT TCTGACTTTG TTAGTCTTTA 
TACATTACTC TAATTCCAGG AAACTCAGCC 
TACGATGTGA TTAAGGCCAT CGGGCAAGGT 
AGCTAAAGGG AAATCAGATA GCAAGCACTG 
TTGAAAAGGT AAAGTTAAGT TCAAATTTCT 
TCAGCTGGCT TTTAATCCAA TATAAAAAGG 
7TCGAA Mil AAGCCATAAT TGA I I I I I GT 
ATTGTCCAAA CAGCAACCAA TGATCTCATT 
ACTGGCTCAC CCCTGTAATC CCAGCACTTT 
TTGTGTGAAG CCAGGAGTTC GAGATAGCCT 
GTCCCTGCTA AAAAAAAATT I ! I I I AATGA 
CACATCCCTG TGGCTCCTAT GGCCCTCCTT 
GTGAGATGCC AGCCTCCTCT GTTGCCCTGA 
CCTCTCTCCT TATTCAGGTC CTCTACGAGG 
GCTAGAGTAG ACCCCCAGCC ACAATCACAC 
CCTTGGTTCC TGGTTTCTTT TTTGTTTTCT 
GGAGTCTCGC TCTGTCACCT AGGCTGGAGT 
CACTGCAATC TCCACCTCCG GGATTCAAGC 
CTGAATAGCT GGGACTACAG GTGCATGCCA 
GTA I I I 1 I AG TAGAGATGGA GTTTCACCAT 
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2901 GTTGCCCAGG CTGGTCTTGA 
2951 GGCCTCCCAA AGTGCTGGGA 
3001 GTTTATTTTC TTACTAGCAT 
3051 TAATTATTCA TTTGTTTATT 
3101 CTGTCACATA ATAGGTGCTA 
3151 GAGAAAAACT ATATGTAATT 
3201 AGTTAATTAA AAATTAATAA 
3251 ATGTCACCCT CTTTAGTGAT 
3301 ATGTCCATTG TTTGCTAGAA 
3351 TTCTAGAGGA TTTATAAATG 
3401 TGTGCAAAAT GGACTACCCA 
3451 TTGAACTGTA CACTTAAAAA 
3501 ACAAAAAATT GCAAGAAAAC 
3551 ACTGGCTTCC AGGGATAATG 
3601 GAATCCATGG CATATTCATA 
3651 TAGCTTCCTC CCATGACATT 
3701 TGCTCTCCCT CAGGCTTATC 
3751 ATGTGTCCTT TTCTGATTTT 
3801 CCACTTCAGT GTTTACCAAT 
3851 CCCCTGGCAT CTTTCAGTAC 
3901 TAACTTCAGG TAATATCTGT 
3951 TCGCCCAGGC TGGAGTGCAG 
4001 CCTCCCAGGT TCATGCCATT 
4051 CTACAGGCGC CCGCCACCAC 
4101 GACGGGGTTT CACTGTGTTA 
4151 GATCCACCCG CCTCGGCCTC 
4201 CCACACCCAT CCAACATCTA 
4251 ACCCCAAACT TAGTGGCTTA 
4301 TGTGCATCAA GAATTTAGGA 
4351 GGTATCACAA TTGTACCCAA 
4401 TGTAAGCAGA CCCACTTCAC 
4451 CAGTGTCCTT ATGGTTGCCA 
4501 TATTCCTTCT CCACATTCAA 
4551 TTCCTTGAAT TTGCTTCAGT 
4601 CCTTCAAGTG GCTCCAGCCC 
4651 AACCTTCACT GGGACAGAGG 
4701 ACATAAAAAT AGAAAACGTT 
4751 TCAAAGAAAT CGAGAAGGTG 
4801 GTGTTAGCCT TTTATTAAAT 
4851 CATCGTACAA AAATTTGAAA 
4901 ATTCAAAGAA AAACCCCACA 
4951 GTAGGTAATT TGTTAAGTAA 
5001 CATGTTGTAG GTGAAGCATT 
5051 TTGAGAAAAT AGTAAGTGTA 
5101 ATTCTCCATA TATATGTATA 
5151 AAGACACAAA TTGTTATTAT 
5201 I I I II I I IGA GACAGGATTT 
5251 CATCATCATG GCTCACTGTA 
5301 CTCCTGAGTA GCTAGGACTG 
5351 TGAAATTGTT TTTAGAGACA 
5401 AAGTGATCCT CCCACTTTGG 
5451 GCCACTGTGC CTGGTCTGCC 
5501 GAAGTTTTGA GACCCTCTCT 
5551 ATACCCTACT I I I I I I CCCC 
5601 ACTTCTTCCA AGGTCGTTGC 
5651 TGTTCAAGGC TGAGCTCTAC 
5701 ATTTATCCTT CTCTGTAAAC 
5751 AGTACTGACT GCTTTCTTTC 



ACTCCTGAGC TCAAGTGATC TGCCCGCCTT 
TTACAGGCTT GAGCCACTGC TTCTGGCCTG 
GTATAATGCT CTGCAATTAC TTTGCTCTCT 
GCTTGTCTTC CTCAGTATGC AGAACAGTTC 
AACACATTTA TTGAGTGCAC TGAATGAATA 
GTTG GTCTAA TGATTTTGGA AAATAAATAT 
I I I I I GCTAA ATCCACCTTG GTCAGTGTTT 
ATGTTCATTT CATAATATAT TGGGACAACA 
TTAATTCTAA GGCAAGTCTT GTTGGTCAGC 
AGAGTAGCAT AAAAAGTTCC ATACAAAGTG 
AGTTACACCA TATGAATATA CTTAATGCCA 
TCGTTAAAAT GATATAAATT TTATCTTACC 
CTACCCAAAC TTAAAGCTCA AGAGTAGATG 
ATTTATTTCC CAATATAGGT CTCI I I I IGT 
ATAATGTCCT CTTATTCTAG TGGCCCGCAA 
ATTCTGCTCA CTCTCTTTTG TTTATCTGAC 
TCTGTCTTCG CCCTGTGTAT GTCCTCAACC 
CTTTTTCTGT CCATTGTCAC CTAAACTGCC 
AAGTAGATCT CTCTTAAATC TCTGTCTCTA 
CCTAGTTCTG CATTTCTTCT GCCAGCTAGA 
GGTTTTGTTT TGAGGTGGAG TCTCGCTCTG 
TGGTGCCATC TCGGCTCACT GCAAGCTCTG 
CTCCTGCCTC AGCCTCCCGA GTAGCTGGGA 
GCCTGGCTAA I II I I I GTAT TTTTAGTAGA 
GCTAGGATGG TCTCAATCTC CTGACTTCGT 
CCAAAGTGCT GGGATTACAG GCATGAGCCA 
TGTTATTAAT CTATTGCTGT GTAGCATATT 
AAGAATAAAC ATTTATTGTC TCAGAGATCC 
TGACGATCAT TGAGACCACC TTGGAGGCTC 
AAACAAGTAT TAATAGTGAT TCTTCCTTGT 
CTCCTATGTG CTGCGCTGTA TTAATGTCAT 
GCCTGAAAAC CTTGGGATCG TTTGTGAGCT 
TTATTTGGCG AATACTGTTG ACTCTTCCTC 
CCTTTTGTCG AGGCCCTGGG TCACTTGGAT 
AATTTTGATA ATGCTCCAGC CATGCCCCCA 
CTGTAAAGAA AGAGTTGCCT AGGTTTGACT 
TGTATGTCAA AACAAACACT ATAAATAAAT 
CCAAAAATAT TTGCAAGTAT TGACTTAATG 
CAATAAAAAG ATAAAATCCA TATATGAAGT 
CTCAGTAGAA AACTAAGAAA TTAGGAGTTT 
GATAAACAGT TAGAAAACAA ATGTCCAACA 
TTTATAAAAA ACTAAGTGGC TATTAGCAAT 
GACATGGGAA AATTTCAATG TTTGCAATGT 
AAATAATATA ATCTTTGGAA AAATATATAT 
CCTACAAATA TGTTCATATA TGTACAAAGA 
TGAGGTAGAA AGTGGGGTTT GCCTTGTGCA 
CATTCTGTTG CCCAGGCAGG AGTGCAGTGA 
ACCTTGAACA AGCCATCCTC CTGCCTCAGC 
CAGGCATGCG CCACCACACC CAGCTAATTT 
GGATCTTGCT ATGTTGCCCA GGCTGGCCTC 
CCTCCCAAAG TGCTGGGATT ACAGGTGTTA 
TTGTG CTTTT ATATTGTTTC ACTCTTCAGA 
GATTTGCTCC AAAACTACAG CTCCTATCAC 
ACTCCAGCCT CTGCATTTGC TTCTGGGGCT 
CTGCTGATCT CCCAGCATCA AGATCCCACT 
CATACCTCCA GAATCCTCCC ACTCTAAAGA 
TTGCATAACT TTTATTGGAA CCTCTGTTAT 
TGGACATGCT TTGGCTGTTT ATTTTGTGCC 



FIGURE 3B 



5801 TTCTCCTCCT TATTTAGCTG 
5851 GTGTTTTGTG CTGGTATTCC 
5901 TATTCAATAA CTTCTTATTG 
5951 GGCCCTCTTG CTCTACTGTG 
6001 CATTTTAATT CTATGTAAAG 
6051 AAATGTACAT TAACATTACT 
6101 TATAATCAAA AATTGACTAT 
6151 ACTTTACTCA TGAACTACTC 
6201 TATATTAGAC CTCATTATTC 
6251 ATTTACCTAT AACATCTGTG 
6301 TCCTATCTAC TCTCATGTAG 
6351 ATATTTCTTC TCTCTAATGC 
6401 TAAATTGCTT GTTTTCTTAC 
6451 ATCCTACCAT TATCAAGCAT 
6501 GTTTTATCTT TCCTCAATTG 
6551 TGAAAAGAGC CAAAGGAGAA 
6601 GCATATAACC TCTTTCTTAA 
6651 TAGACTATTT TTAATGTATA 
6701 ATGGTAGATA TACT C I I I I I 
6751 CCAGGCTGGA GTGCAGTGGC 
6801 CCAGGTTCAA ACAATTCTCC 
6851 AGGCGCATGC CACCATGCCT 
6901 GCTTTCACTA TGTTGGCCAG 
6951 GCCTGCCTTG GCCTCCCAAA 
7001 CCCAGCCCAG GTATACTCTT 
7051 TGTGAATATT TCCTCATCTC 
7101 CATTTAATGA TATTAAATGT 
7151 AGGGTGTCCA TTATATGTCA 
7201 AACTCAATTT AATCTTCACA 
7251 ACTGTACAAG TGAAAGAGCT 
7301 GGATGCACAG TAATCAGCAG 
7351 TGATACTACA TTGCCTCCCT 
7401 GGTAGCAGGT CAAAAATACC 
7451 TGTAGTCCCA GCTATTCTGG 
7501 GGAGTTAAAA TCCAGCTTAG 
7551 AAAAAAAAGC TAGCAAAACA 
7601 TTTCAGATAA TTACATTGTT 
7651 ATTTAATATG TAAAGTATGA 
7701 ATGCTTGCCT ATATATGCAT 
7751 TTATCCCTAG GGAATATAGT 
7801 TGCCATTAAG AGTAAGGTTA 
7851 TGAGAAAGAA AGAGGGAAGT 
7901 CTTAAAACAA GCGTGCATTA 
7951 ACTGTAACAA AGTCTTTTGT 
8001 TTCTTTGGAT AAATTTCTAG 
8051 TCATGCCTGT AATCCCAGCA 
8101 GAGGCCAGGA GTTCGAGAGC 
8151 TACTAAAAAT ACAAAAAATT 
8201 TCAGCTACTC AGGAGGCTGA 
8251 GAGGCTGCAG TGAACCAAGA 
8301 GAGCAAGACC TTGTCTTAAA 
8351 TACAGACCCC CCACAAAAAT 
8401 TAAAACTGCC TTCCCAGAAA 
8451 AATAAATGAG TGTCCATTTT 
8501 ATTCTTTTCA TCTTTGCCAG 
8551 GTTTTAGCTT TTATTCCTTT 
8601 GTCACCCAGG CTGGAGTGCA 
8651 ATCTCCCAGG TTCAAGGGAT 



TAATATGTTC TGTGAGGACC GAGTCCATGT 
ACACAGCACC TAATGCTTGG TGCCAGGAGA 
GATAGATGAT TCACTGGACA GATGCTTTCA 
AAGCTGGTAT ATACTTAGGA ATTATAAAAC 
AGAAAATATT TGAGAGGTGA ATCTCTATAA 
G CATTT CATA GCATCTCTCC CATTCTTTAG 
ATTTTTCTAA TAGAGCACCA A I I I I I CATC 
TTGTCACTAT GCCATAAATA AGTAGAATCT 
TTGTTTTCCC ATATCTGTTT ATGTTATCGA 
TCACAATATT AACATTTATT AC I I CI I I CT 

I I I II CATTA CTTCTTATCT AGAGAAATTT 
CTCCCTACTC CCTACACTAG ACCCCAGAAC 
AGGTACCAAA AAGCTAATAT TTCTCTTATC 
GTTCTTTTCC TTCTGGGCTC AAATAAAAGT 
TGAAAATAAA AATGTTCGTT GTAGAAATTT 
AATAAGACCA TTTAGAGGAA AATAAAAATA 
TCACTATGAA CACTTTGCTG AATTTCTCTC 
AGTATATAAG TTATTAGAAT GATTGGTGTC 

II I I I I I GAG ACGGAGTCTC GCTCTGTTGC 
GTGATCTCTG CTCACTGCAA TCTCTGCCTC 
TGCCTCAGCC TCCTGAGTAG CTGGGACTAC 
GGCTAATTTT TGTAI I I I IA GTAGAAACAG 
GCTGGTCTCG AACTACTGAC TTCGTGATCT 
GTGCTGGGAT TACAGGTGTG AGCCACTGCG 
TTGTAACAGT I I I I I AT ATT AGCAATATAT 
ATTCAATATT TTTATATAAT AAAATGTTGT 
GTTCACACTA ATGATAAAGG GACCACCTGC 
CACCATCCTG GGTGTTTTAT TATGTATATC 
ACCACTTAAA AGGTAGCTCT CATTACTCTC 
GAGGCTAAAG . AGGTTAAGCA GTTAGCTCCA 
ATCCATCTAA GTCTTTCTCT GCTCTTTCCA 
TT A I I I I I AA TGACTGCATA GCATTAAAGT 
ATAATTTAGC TGGGCATGGT GGCAAGTGCC 
AGGATGAGTT GGGAGGATCC CTTGACCCCA 
ACAACATAGC AGAACTCTGT CTTAAAAAAA 
CCCCTGTAAT TTATTTAACT CTTTTTCTAT 
TGGTTGGTTT TTTGGCTACG ATTCAATAAC 
TTCA I I I I I A TTAAACAAAA CTATGTATAT 
GAAATAAAAA GCTCTAACTA TTAACAACAG 
ATTAGGTTGG CGCAAAAGTA ATTGCATTTT 
CCACCTATGG GCTTTCGTCT GTGGGCTAGA 
TTCACTTTTA CCTTATTCAC TTCTATTTGA 
TTAGAGTAAC TTAAAAACT A GCAATAAAAC 
ATGAGAACTC TTCTGTACCC TTTTATTATC 
AAGAATTAGT CAAAAATAGG AACATTTCCC 
CTTTGGGAGG CTGAGGCAGC TAGATCACTT 
AGCCTGGGCA ACATGGTGAG ACCCCATCTC 
AGCCGGGTAT GATGGTGCTT GCCTGTGGTC 
GGTGGGAGGA TCACTTGAGC TCAGTGGGCA 
TCATGCCACT GCACTCCAGC CTGGGTGATA 
AAAAA AAAA A AAAAGATTTC TTCAGCAGGA 
GAACATTTTA AAGATTCATA TTATATATTG 
TATTTTATCA ATTTGTGTAG TTTTACCAGA 
GCTGCTTTCT GGCCAATAGT AGTTATTGAC 
TTTCATACAT GGAATACTAT ATTACATTTT 

I I I I I II I I I TTGCAATGGA GTCTTACTCT 
GTGGTGTGAT TTTGGCTTAC TGCAGCCTCC 
TCTCCTGCCT CAGCCTCCTG AGTAGCTGAG 



FIGURE 3C 



8701 ACCACAGGTG TGTGCCACCA 
8751 AGACAGGGTT TTGCTATGTT 
8801 AGTGATCTGC CTGCCTTGGC 
8851 CTACCACACC CAGCCAAATT 
8901 CATGAGATTG AATAI I I I I I 
8951 TCGAGTTGAC TATTCTTGTA 
9001 I I I I IAAAAA TTATTTGCCA 
9051 TATATGGTAC AAATATTGTA 
9101 TTTATAATAT TCTTTTAAAT 
9151 AAATGTATCC AGCTGGT AGG 
9201 CTATGTACTG ACTATACAAT 
9251 GAGGGGTTTA AGGAACTTAC 
9301 GCTAGAATTG AGAATATTAA 
9351 ATGAGAAATC TGGCCAATGT 
9401 GACATCAGAA AAGCGGGGCA 
9451 CAAGTCAGCA AAGCATCAAG 
9501 GGCCACTGGT ACTGGCACCA 
9551 AAGTAGAACA AGAAAATACA 
9601 GACAAI I I I I ATTTGAAAAT 
9651 TACTGACCTG ATTTCTGTGG 
9701 ATTTGTTTCT TCTCAGCAGT 
9751 AGAATTCCTT CGCTACTGAA 
9801 CTAGATAATG AACACGGTAC 
9851 TCACCAGAAA TATTTATTTT 
9901 GAGATGTAGG GCACTCTGCT 
9951 AAGATCGTTT TCAAAGTGAC 
10001 TAGATGGAAA TGAGCAGAAA 
10051 GACTACTCTG TCTCTTAATG 
10101 CTTCAAAGAA AGAAGTGATT 
10151 GTAGCCTTCT TCAATTCATT 
10201 TAI I I I I ATA AAGTATAGGC 
10251 AAGGACTGTG TATAAATTAA 
10301 TTAAATTGCA GCTTGAGACG 
10351 GTAGACGGCT TTCTTATGTT 
10401 ACTCTCCCAA AAAATTAATG 
10451 CAACCCCCAA ATTTAAGTGA 
10501 ACATTACATG CTAGCTGTGG 
10551 CTGAAGGAGC AGCTCAGATA 
10601 GAGGGGAAAA CACAGAGCCC 
10651 GATGTGTGTG TTTGTGTGTG 
10701 TCCTGCATAT CCTCAGGCAA 
10751 AGCCAGCAAA CAGCCACGCA 
10801 GGAGAACTTT AGGGAAGGAA 
10851 CTGCTGACAA CTTTAGGGAA 
10901 AATTCTGCCT CCACGCATCA 
10951 AATGAAGGAG CATTGGTCAG 
11001 TGAGTGCTTA CTCTATGCCA 
11051 GTGAACAAAG CAGATGAAAA 
11101 GGGAAGAGAT AGACAATAAA 
11151 GTTGTGACAG GGCTGTATGT 
11201 GCCCAGCCCC CAGCGCAGGG 
11251 GCAGAAGTGT TAACCTCAAT 
11301 CAGTCAATAA ATAAATACAG 
11351 GAGCACTGTG TTCCTTACTG 
11401 TGTGTATGGC CCTTCTTTGG 
11451 CTCTTCTGAG GTCTTCCTGG 
11501 GGTTGCATGG AATAACTTTC 
11551 TTCCTGTCCA TTGGATACTG 



CGCCTGGCTA ATTTTTTGTG Mill AGTAG 
GGCCAGGCTG GTCTTGAACT CCTGGCCTCA 
TTCCCAAAGA GCTGGGATTA CAGGCATGAG 
TTGCTTTAGT TTTTATTCCT TTGATTACTG 
CTATCAGCCA I I I I IATTTC T CI 11 I 11 I I 
CTTTGCTATT TTTCTGTTGG GGTGTTTGCC 
TCAAI I I I I A TATTATAAAT ATATTTGTCA 
TCTTATCCTT TTGTTTGTCT TTTAATTTTG 
AAATAGTAGT TAGGAATTTT TTAAGTTGCT 
AGTAATTTAG CTGTTTTTGT TTTGAAACTC 
TTAAATTGGG GCAGGAAACA CTGAAGCTTA 
TGAAGGATCC TTCAGCTGAG ATGTAGGGAA 

I I I I IAAGAA GTTCTTAAGT CTAAATGAGA 
TGAAGACCTC TAATGGGTGG AGGCCCCGTG 
GTCAGGGGCT GGAAGTCAGG GTAGAAATGA 
AGTGAGGAAG AAAAAGTAGA AATGAGGTGT 
AACCCCTTGG CAAGTATTGT CTATAGGTGA 
CCCAAATACT TCTAAAATGA AGTCATGCAA 
GAAAAATGTA GTCATCTTAA TACAAAATTT 
GATATGACAC ATTTTCTTTT TTTAGATTTC 
GATTGCTCCT GGAATGTTGC Al I I I IATAA 
AGATAGATAT TAAAATATGG CTCCATATGG 
CACCAGTCCA ACTTTTAATA TAGCAAAACT 
CTTGATGATG GTTGTCAACA AACCATTGAT 
AATTCTAGAA ATGTTGTTTC CTGCCATTGA 
ATTAAAAGCC AGTGAAATCC TAGAGAATTT 
GCATGTTCTT GAAACCAAGT TAGCTTTATA 
TAATTTAGAT GCCCATACAA GAAAAAGAAG 
CTTCTGGAAA AGATGAAACA TCCCAACATT 
TCAAGGTTTG ATTTTCTAAT ATTCGTTAAG 
ATGTTGACAT ATGTAAAAAG ATTTGTTCCT 

I I I I I GTAAA TGGGTCATTT CCCCATTTAC 
TCCTCGTTAT TTCCTCTCTA GTAAG I I I I I 
TTCTTGTTTT TCTGCCTCTC CTTAATTCTC 
ACTGGCTTAT TAGCTTCTTT GCTGTAGTAA 
CTTACAATAA GACACATCTA TTTCTCACTT 
TGGGCTGGGG TCTTGAGTCT GGGGCCCAGG 
AGGAACCAGC TGTTCTCATA AGCAAGAGAA 
ACCACACTAT CGCTCTCAAA GCCCTGCTAG 
TGTGCTGGGG GGTACTCTGT TTACATGAGA 
CAGATGGGAC TGTGTAATCC TCTTACAGAG 
CCATAGCCTA GCACACTGCC ACGGAGAGGG 
GTACCTTCCT CTGTACACCT GAATACAATT 
GGAAGTCCCT TCCTCCATAC ATCTGAATAC 
CTGTAGTCCA AAGGTAAAAA ATAAATAATA 
ACAGCATTCA TTCACTGAAC TGATACTTAT 
GGCATTGTTC TAGGTGTCAG GAATATAGCA 
TCCCTG I I II CATGAAATTT ATATTCTAGT 
CAAATCTACA GTATGTCAGG TGTGTCTTAA 
GCTGACAGTT TTATGAAGGG TCATTCCCCA 
CTGTTTTAAG ACTGATAATT AGTTCATGGA 
ATCTTCAAGC ATCATCAGTT GGGTAAAAGT 
CCACTGTGTC TTGAGTATGT AAACTGTGCA 
ATTAAAACCG CTACATTCAA GGTACTTCTG 
CTTCTGGGTA TTTAAAAAGA GCTCTTGGGA 
GAGCAGAACA GTACACATGG TCTGGAATTG 
AAGGAAAGCC ACTGAATAAA GTGCCCTGCA 
ATAATGCTAT AAGATGATCT TTCTCTTCTT 



FIGURE 3D 



11601 TATTTTGTTT GAGATTATTG 
11651 AGGCCTTTTC TGAACTCACA 
11701 CCACAGCAGT TGTACTTACA 
11751 TCATATTTGT CGCCCTTTAC 
11801 TCTGTTTTGC TCACCACTTG 
11851 CAATACCTAT TTACTGAATG 
11901 GGAGTATTTC CTTATACCAT 
11951 CTGGAAGCTT TTGCAGAACT 
12001 GCTGTGTTTG TTGATACTGT 
12051 GTAGAGCACC CCTGTGTTTA 
12101 GTTTGCTTGG CTGATTGCTG 
12151 CTCAGAGGCC TTGTAGCCTC 
12201 TTTTCCCTCA AATCGCTTAT 
12251 TTGTCTATAG AAAATAGAAT 
12301 TAATCCCAGC ACCCTGGGAG 
12351 AGTTCAAGAC CAGCCTGGCC 
12401 AAAAACGTTT AGCCGGGCAT 
12451 GGGAGGCTGA GGCAGGAGAA 
12501 TGAGCTGAGA TTGCGCCACT 
12551 TCTGTCTCAA AAAAAAAAAA 
12601 CAACATAAGA AAAATAATGA 
12651 TAATTCCATG TATATTCCAT 
12701 GCCAGGCATG TTCTAGGTCT 
12751 AAATTCCCAT GCTGTTGTAT 
12801 ACAAATTTAG TGGCTTAACA 
12851 GTAGGTCAGA AGCCTGAGGT 
12901 CAGCTAGTTG AAATCAAGGT 
12951 GTCCTCTTCT AAGCTTATTT 
13001 TGGGTTAATT GGATCATGGC 
13051 TGATCCTCTC GCCATGGCCT 
13101 AGAAAGAGCT CTTTTACATT 
13151 CTCTAATGCA CACATAAAAA 
13201 GTATTCTAAA AATTATTTAG 
13251 CAGCAAGTGA TTGGAAGATG 
13301 I I I I I I I I I I MIN I GAGA 
13351 CAGTGGTGTG ATTACAGCTC 
13401 ATCCTCCCAC CTCAGTCTCC 
13451 ACACCTGGCT AATTTTTGAA 
13501 CATGTTGCCC AGGCTGGTCT 
13551 CTTAGCCTTG GCCTCCCAAA 
13601 CCAGGCCCTT GATTTTTTAA 
13651 GTTTAATAAA TGTTGGAATT 
13701 CAATTGTTTT GAAGTGTCAG 
13751 TTTGAGAGCA TAAATGATTT 
13801 ACAGTGGCTA ATGGGGGAAA 
13851 TGACATTAAG TAGTTTCTTT 
13901 TCTCACTCTG TCACCCAGGC 
13951 GCACCTCCAC CTCCCGGGTT 
14001 TAGCTGGGAT TACAGGCGCC 
14051 NNNNNNNNNN NNNNNNNNNN 
14101 AGAAATAGGA TGAAATGACA 
14151 CTGTAAAGTG GGCATGGTAC 
14201 AGAGATTAGG GCTGTGAAGT 
14251 TGAGGGTAAA TTGTAGCACT 
14301 TATCTATATA TTCATGACAA 
14351 AATACATCTC TATCTCTCTG 
14401 CTATACAAAT TCCTCTTCCT 
14451 CCCAGAACTT TGGGAGGCCA 



TGACTCTCTG GCTAACTCCT ACTTATCCTC 
ATTCAAATTA CAGCTCCCTT TGGTTCTCTT 
TATGTCTATT TATATAATTA TGAAI I I G I I 
ATGGTAAACT TAATGAATTT TGGGGCTCCA 
ATCCTTGGCA TGTAGCACAC AATGGCTGCT 
AGCAAATGGA CTGGACCACT TTTAGAGACT 
GTGAGATTGA Mill GAGGA CAGTTTACCA 
AAGGTCATTT TTACAGTATA CATAACCTCT 
AAGTTTACAT TTTCTTATGA CTCTTTTTAA 
GGAAAGCTAG AGCTATTGTG ATGCCTTTGA 
GGACTTGAAC TACTGAGCTT ATCTAAAAGC 
TGTLI I I I AG AGAGTGTAGG TAAAGGCTTG 
CTCTGATCAT AAGAACCATG GCTCTAATGT 
GTTTTGGCCG GGCGCAGTGG CTCATGCCTG 
GCCGAGGCGG GCAGATCACC TGAGGTCAGG 
ATGGTGAAAC CCCGTCTCTA CTAAAAATAC 
GGTGGTGTGC ACCTGTAATC CCAGCTACTT 
TCGCTTGAAC CTGGGAGGCA GAGGTTGCAG 
GCACTCCAGC CTGGGCAAGA AGAGTGAAAC 
AGAAAATAGG ATGTTTTTAT TGGTTTGAAG 
GAATGTAGTG ATATTTTCCT AAGACAAAAT 
CAATAAACAT TCACTAAGTG TCTGTTATAT 
TG GAGAT ATA TCAGCAAACA AAATAGGCAA 
TTGTTTTCTA TTACTACATA" ACAAATGAAC 
ACAACACCTA TTTATTATCT CTTGATTTCT 
TGGCTTAGCT GGATTCTCTG CCCGGAGTCT 
GTCAGCTGGG ACTGTTATCT GTGGCTCATG 
AGGTTGTTAT AGATTTCATT TACTTGCAAT 
TCACTGCAGC CTTGAACTCC . TGGCCTCAAG 
CCAAAAGTGC TGTGAGTACT GTGCCTGGCC 
TATTTAAACA CAGAGTTTTA TTTTATATTA 
AGAAAATATA AGCAAACAAA GTTGGTTAAG 
GCAGTGAAAA CATTAAGCCT GCCGGGTCTA 
CCAATGTCTG TAAGAAACAA TCTTGATTTT 
CAGTCTTACT CTGTTGCCCA GGCTGGAGTG 
ACTGCAGCCT TGACCTTCTG GGCTTAAGGG 
TGAGTACTGG GACTACAGGC ATGTACCCCC 
TAI I I I I ICA TTATAGAGCC AGGTTTTCGC 
CAAACACCTA AGTTCAAGCA ATCCACCTGC 
GTGCTGGGAT TACAGGAGAG AGCTGCTGCG 
AAGTGCATTT TAGAATGAAT TATAATAATT 
TGACAAATAA AAAGGTTATT TAGTGCCCCT 
TGATCCATGA GCTTTACAGC AGATGGAAAA 
TTCCAGACAC TTCCAATAAA TATAAAATTA 
ATCCTTAT TT TACAGTCAGA TAATGCTAAT 
I I I I I I I I I I I 1 1 I I I 1 I I I TTTGACGGAG 
TGGAGTGCAG TGGCACAATC TTGGCTCACT 
CAAGCAATTC TCCTGCCTCA GCCTCCTGAG 
CGCCACCATG CCTGNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNTAATAA TCAGGGATTT 
AGCTAAAGTC TTTGAGTCTT AGTTCCTCAG 
CATTTCACAG GAATTGATAC GAGGATTAGA 
GCCTGACACA CAGATAGTTC TTGAATCAAA 
CATCTGTCTC ATAAGCCTAA ACTGCACATG 
AAATCATTCC AACACCCAGG GGCTGTCCAA 
TATACCTATT CCCTCTCCAC CCCCCAAATC 
CGGCCGGGCA CGGTGGCTCA CGCTTGTAAT 
AGGCGGGTGG ATCACGAGGT CAGGAGATCG 



FIGURE 3E 



14501 AGACCACAGT GAAACCCTGT 
14551 CATGGTGGCG GGCGCCTGTA 
14601 AGAATGGCGT GAACCCAGGA 
14651 CACTGCACTC CAGCCCAGGT 
14701 AAAAAAGGTA TCTCTGTCAT 
14751 AAGCCAGTGT AAGGTTGAGT 
14801 TGAAATAATG TGTCTCCTCT 
14851 GAAATTGCCA TCCCTTCATT 
14901 CAATGACAAG AAACATCAAT 
14951 ACCGATCTCT AGACATTAAG 
15001 TCCTAGATCT CCAGTCTT AC 
15051 TACAGACTTC TTCACAGCAC 
15101 TAGACTTCGC CTCCTTAACA 
15151 CCTACACAGC ATTCCTTGCT 
15201 AACTGAATGG ATCGTCTCTT 
15251 TGAAAGAAGA TGCATGGTTT 
15301 GCCAGGTTAC TGATATCCAC 
15351 AGGATAGCAA AAGATAGATA 
15401 ACTACAGGGG AAAATGTTAT 
15451 CTTATG I I I I GTGTTCCTTC 
15501 ATCCACTCAA CAAACATTTA 
15551 GTACAAAAGA ATGAAGTGTA 
15601 CTAACAAGAT CCCAGCTGTT 
15651 GAGCAGAAAA TCATCTGTTT 
15701 CATAAACTAG GTGGCTTATA 
15751 GAGGCTGGGA AGTGCAAGAT 
15801 GGGTTCACTT TCTGGTTCAT 
15851 GTGGAAGGGG AGAGGGTTCT 
15901 CCACCCTCAT GACCTAGTCA 
15951 ATCACTGTGG GGGTTAAAAT 
16001 CATTTGGATC CTAGCAATTA 
16051 ATCATCCTTT GAGTGGAAAT 
16101 AAGGAAATGA CAAGCATTTG 
16151 AGCAAGTGTT TGTGGATAAT 
16201 TTATATTTCA TTCATTTCAT 
16251 TTTATTTTTA AACTCACTGA 
16301 AGGAATAATC ACATCTTTCA 
16351 ATAAAGACAA GAATAACTCA 
16401 TGGTAGATTA TGAGTCCACT 
16451 AAGTTATGTG TCTT ATTTAG 
16501 CAAGGATTTG TGGATAGTCT 
16551 ATATTTCATT CATCCCATCA 
16601 AACTACTCAC TTTTACTGAG 
16651 AAATAATCAC CTTTCATTTT 
16701 AGACAAGAAT AAATGGCGTA 
16751 GGTAAGATTC AGTTCAATTG 
16801 GAGCCTAGTA GTATTTAGTG 
16851 GTGTCCTGTA ATCTAGTGAG 
16901 TCTATTGTAT TCAACAACAT 
16951 TATGCCCAAA TTCAAGAAGG 
17001 AGAGTCACAG CTGCTATTTT 
17051 GTATTAATAA AAATAGCCAA 
17101 TTATTCTAAG CATCTGACAT 
17151 CAAAGAAGAA ATTTCTATAA 
17201 CACGGAGAAG TTAAAGTGAC 
17251 AATGGTATAG TGGAATTAGA 
17301 CATGGCTGAT AAATATCTGA 
17351 TTACAGAGAA TGGCAGGCTG 



CTGTACTAAA AATACAAAAA ATTAGCCGGG 
GTCCCAGCTA CTCGGAGAGG CTGAGGCAGG 
GGCGGAGCTT GCAGTGAGCC GAGATTGCGC 
GACAGAGCGA GATTCCGTCT CAAAAAAAAA 
GATGAATTTT TAGTCAGTTT CTTTCACCAA 
GAGTTGGAGG TTGGTGAAAA GGAAGTGAGA 
GTTTCATTAT TCTGATTCTT CCATGGTTTT 
GCTGTGGCAG ACC I I I I ACT GACTGAGCTT 
TCTCCTAAAA GTAACACATT CCTGAAAATA 
GCATATGGGG AAGCATCTTC TCAGAACCCT 
TTGCATGCGC TTTCTCCTCA CCCTCCAGCC 
TTCTCACCCA ATGCTGAGGG ACTCCCCAAC 
GTAGAATGGA TGGCTTCTTT GCI I I I I ATT 
TTTGCATGTC CTCAATAGAA GTTTGTTAAT 
AAAGAAGAGA GGAGGAAAAA TTGAAATATG 
GTGAATTAGA AGCAACCAAG GGTAGACACT 
AGTAAAGTTG GTTAGGGTAC TTTAAAGAGT 
TTTGGCAAGA GATTTTGGCA TTTAATGGGT 
CAACAATTGC TTATAAGACT GATTTTGGCG 
AGGG I I I I I I TGTTTGTTTG I I I I IAATGA 
AGCCCCTTTG ATGTGCTAAC TACTGTTTAG 
GACAAACAAG TGAGTAGAAA TCCTTCTTTT 
AGTTGGTTGG CTAATGAAGA AAGCTGGTTA 
TAGTCT ATTC CAGCAGCTAT AACAAAATAC 
AACAGCAGAA ATTTATTTCT GGCAGTTCTG 
CAAGGTGCAG GCAGATTCAG TGTCTGGTGA 
AGATGGTCCG CCTCATTGTG ACTTCACATG 
CTCTTGGACA GCAATCCATT AATGTGGGCT 
CCTTCCAAAA GCCCTATACC TCCTAATACC 
TTCAACATAT ACATTTGGGG AAGACAAAGA 
TACAGACATA TTTTAACATA AGAAGACATA 
GGCCAGGAAA AAAAAAAAGA AAAAAAATTT 
TTAAAGGATA A I I I C I I I I C TTTAATACGG 
CTGTCCACAA TCCTTTTAGA AGTTTTCTAG 
TCAACATTTA GGTCAATGGT TATTTATTTA 
GTCCTCCAAA ATATTCAGCA TAGCTTTTGG 
TTTTCTATTC ATATTTCATC AGTTTATGTA 
CTACAGTTCA AGAAAATTCA GAATTATAGT 
GACTATAGTT CTGAATTTTC TTTCTTATGT 
AATTTCTAGT CTC I I I I CI I TAATGTGGAG 
GTACATAATC CI I I IAGAAG TTTTCCAGTT 
ACATTTAGGT CAATAGCTAT I I I I I I I AAA 
TCCTCCAAAA TATTCAGCAT AGCTTTTGGA 
CTATTCATAT TTCATCAGTT TATGTTACAA 
AACATATTTG GGAAAAAACA AAATGATCTT 
GTAAGAGCAG AGGTACTTGG ACATACTAGA 
GTAACGTTGA TGGGGCAATA GCAATGGAGA 
TGGTTTAGGC AGAGGTCAGT TATAAGAGTT 
AGATAAATAA GTGCCATATA AATATGGCTT 
GGCTATTAAT TCTTCCTGGG GTGTTATGAA 
TACAGCAGAT TTCTAACCTC TCAAAGGAAT 
CATTTCTTAA ACTCTCATTA TGTACAGGCA 
GGATTAACTC ATTTAATCTT TCCAACAATC 
TATTTCCATT TTACAGATGA GGTAATGAGG 
TTGCCCAGAG TCACAGAGCT AAGAAATATA 
ACCCAGTCCA TCTTGATTCC ACAGCCAGAA 
AATCCTTCCA GCACCAAATG ATTCCTTTTC 
TTTATTGTAA TGGAATATTG TGATGGAGGG 



FIGURE 3F 



17401 GATCTCATGA AAAGGATCAA TAGACAACGG GGTGTGTTAT TTAGTGAAGA 
17451 TCAGGTAAAA ACTTCTAATT TGCTTTTTAA TTTTATGTAT GTGTGGTGGG 
17501 GGGAGAATTA AATTTATAAT GTCATAAGTA GTAGACTATG TTGATATAAA 
17551 GCATGTGTTT TGGTAGACAG ATTGAAACCA TAAAATAGTG GATCAATATT 
17601 CTTGGTA GAT TCAGTCCAGA ACAAGTTTGC AATTGAACTT AAACTGAATT 
17651 G i I I I I I AGA GAGTGAGATT TTCTTGAGAA AAGATAATCT GTTTGGAAAA 
17701 TCTCATGTAG GATGCCTCTG AGATAAGTCT TCATGGTTAA AAAAAAATCT 
17751 GAATGTGAGT GTTC CTTAGC CATTTAACAT GTAACATATT TATAGCTTCA 
17801 CTGTTTTCTC TTTTACCATT TGGTGCTCTG TTTTAAACTA GATATCCCAC 
17851 TTTGCTAGGG AGGATAGAAG CTTGTCTTGG CAATGCCTAT TTAGTTTCAT 
17901 TGGTTATTAA GAGGGGGAGA TAAAAGATGA AGAATAATGG CCTCTCCCAC 
17951 TTTCTCTTTG CACAAATGTA TTTCTCTTCT ATACTCCAAG CCTCCCTGGA 
18001 ACTCTCTTGG GAGTGGTACT CATGAAGGAG ACAGGTTTGT ATGTGTGGAG 
18051 AGGAATCTGA ATTAGCTCCT TTATGATTGA TGGTAAGGCC ATTGCCTCAA 
18101 GAAGC ACACA GGAAAGGCCC ACCATCTTTC CTTTGGCCAT TGTTTCTTTG 
18151 TTCTATTTTA GCATGTAAGA GCATCATGCC CATATAATTT CCTCTTAAAG 
18201 TTGAATATTT TGAATATATG AAGGATTAAA AAATCAATAT CTCT AACTTC 
18251 TGTAAGATTA ATCAAGCATT CTTTGTATGC TCATTTATAT TATATATTAA 
18301 AT TCCA TATT GATAGAAACT CTTTTTTCTT ATCTAGGTAT TATTTTGCCA 
18351 CATTTTATAA AAATGCTTCT CATGACAAAA TTTTGAGTTA CATTTCTTTT 
18401 TGTTGGGAAT GAACTAAAAT TACAACTGAA TATTAATGTC TGGAATATAG 
18451 CTTTATTCCT ATTATTTTCC TCTTTCTGTA TAATTTGGCA GACAAAGAAC 
18501 CAGTGAAATT TTAGAATAGT TTAAATAAAT CTCTGTAGGC ATAGAACACA 
18551 TTTTCATAAA GAGGCTCATG GTCAACAAAG ATAAAATCAA ATCATGACTT 
18601 AGAAATAAAA CTAAACTTCA AAGGTAAAGT ATTTGTTGGT TTTATATTAG 
18651 ATATACTGAT ATTTTATTAC AATTCCTAAC CTCACAGATC CCCCATTTCT 
18701 TCCTCTTTCT CTCCCCACCC TTGTCACCCT CCTTCCACTG TAAAGGAAGA 
18751 ACCAATGGCT CCCAGGTTAT CAGGAAACAG GGCTGCTTGT GTACTATTCA 
18801 CGATGCAGTT AGCACCCCAG GGTTAAGTAG GAAAAAAAGA AAAACATGAA 
18851 CGGCATGCCT CTTTCCCTTG CTTCTACTTA TC I I I I I CTG CATGTGGAAT 
18901 TTCCCTTGAT TTTACCAGTG ATATTTGGAT TACTTTTCTG TGCCTCCATT 
18951 I I I I IAGTTG TAGAATGAAA ATAATAATAT GATAAAGTGT ACCTATTAAC 
19001 TTCATTCCTA TAAATACACA TACACTATAT GTGTGTATGT GTGTGTGTGT 
19051 GTGTGTATAA TTTCTATCTT TTTGCATGTT ACCATGAAGA CATTTCAGTG 
19101 ACTACCAGGC TATTCAGTGG CTTTGTTTTG TGTTCTCTCT ATAGATCCTC 
19151 GGTTGGTTTG TACAGATTTC TCTAGGACTA AAACATATTC ATGACAGGAA 
19201 GATATTACAC AGGGACATAA AAGCTCAGGT AACAGCTCAG AGAGAAGACT 
19251 AAGACAGAAC TGATCTTTTC TTGAAGTACC TCAAACAACA TGACATTTTC 
19301 TCCATTTATA GAACA I I I I I CTTAGCAAGA ACGGAATGGT GGCAAAGCTT 
19351 GGGGACTTTG GTATAGCAAG AGTCCTGAAT AAGTAAGTAC TTTGAAAATA 
19401 AM I I ICTTT CTAGTCAAAA TAGCCCAAAT ATGTA Mill AGATATCATG 
19451 GATTAAGAAG ATATTAAAAT CTTGGTTGTC TAAATAATTT TAGGTAGCTT 
19501 TATGTAAATG CATTACATCA GATGGTACTT TGAGATTAAA ATTCTCAAGA 
19551 TAAATTGTGG TGTAATAGAA TGATGTTGCT AATATTCTGT AGTGTGATTC 
19601 CAGTTTGTCA AATATGGATG TGACTGTAAT ATGCATAAAG CTAGAGAGAA 
19651 TTTCGTGAAA TAGGCAGGTT TACACTTCTT AATGAAAAAA GTCAAACTCT 
19701 ATAAAATATT TGAAGAGATT TATTCTGAGC CAAATACGAG TGACCAAAGG 
19751 TCCATGCCTG TGACATAGCC CTCAGGAGAT CCTAA GAACA TGTACCCAAG 
19801 GTGGCCGGTC TACAACCTGG TTTTGTACAT TTTAGGGAGA TGCAAGACAT 
19851 CAATTAGATG TACATGGGTT TGGTCCAGAA AAGCAGGACA ACTCAAAGCT 
19901 GGGAAGAATG GGAGGGAGCT TCCAGGTCAT AGGTGGATTA AAAACTTTTC 
19951 TGATTGGCAA TTGATTGAAA GAGTCTATCT GAAGACCTGG AATTAGTGGA 
20001 AGGGAGTGTC TGGGTT AAGA TAAGGGGTTG TGGAAATGAA GGTTTTT ATT 
20051 ATGCAGATGA AATCTCCAAG TAGCAGGCCT CAGAGAGAAT AGATTGTAAA 
20101 TATTTCCTCT TATCGGATTT AAAAAGGTGC CAGACTCTTA GTTAACTTTT 
20151 TCCTGGATCA GGAAAAAGCC TTGGAAAAAG AAGGGAATTT TCTTCAGAAT 
20201 GTAGATTTTC CCCACAAGAG ATACCTTTGC AGGACTATTT CAAGATATGG 
20251 ACAAAGAAAC ATGATTTGGG GTAAAATATT TTGATTCCTT TCAGGCCTGC 



FIGURE 3G 



20301 TATCTGTCAT GTGATGTTAT 
20351 TTGCTACAAG GAGTCTGCTT 
20401 TGCTGGTCAA CTGTGCCTGA 
20451 ATATCAGACC CTGCTTCCCA 
20501 TTTGGAATGT CCTTGGCCAA 
20551 TAGAATTTTA Mill GGTTT 
20601 CCTGTTTGGA AGACAATTTG 
20651 ATCCTGTTTC TGGGACTTCT 
20701 CGTAAATGCC TAAATAATTG 
20751 TGGCCATGGC CATGTACCTG 
20801 ACTTCTTGAT AACTTTGTTA 
20851 TTAAATACAT TGTGATCACG 
20901 CAGGAAGGAA ATATATCAGA 
20951 ATTCTGGGTA AC I I I 1 I I CT 
21001 TAAATCATGA GGTTGCAATT 
21051 AACATGACAA TTTCATCTCT 
21101 TTTTAATGTG T AAA MING 
21151 GATTTAGTTA CTTTATCAAC 
21201 TATGCATCAA ATCTTGTGTT 
21251 ATGTCTTAAG ACTTTTCTTA 
21301 TTAATATAAT TTACTAGTTC 
21351 ACCTTACTAC CTGTCCCCAG 
21401 AAACGTAAGT TGCTGACTCT 
21451 ATGACACTGA ATGAAGATTC 
21501 TTCTTATGGT ACTTTTGTAA 
21551 GGACCAAACT GAGGGTTGGG 
21601 CAGATGAACT GGGGAGGAAG 
21651 AGAAGGCCTG GAAATTATCA 
21701 AGAGCTTATA CACCTTCTAA 
21751 TTTAAAGACA TACTGATTAA 
21801 CCTGAAGACT TTCTTCTGGA 
21851 TCTAGGTCCT GGGGTGATTA 
21901 GGTTTAGGGA GTTCCTGCAG 
21951 GGGAGTTTCT TCAGACCACC 
22001 TTGTTAAGAA TTCCTTCATT 
22051 CCTAGGCAAA ACTCTTGGTG 
22101 GGGCACTGGC II I I I I I II I 
22151 CACCCAGGCT AGAGTGCAAT 
22201 CTCCCAGGTT CAAGCGATTC 
22251 TACAGGTGGC TGCCACCATG 
22301 ATGGAGTTTC ACCATGTTGT 
22351 ATCTGCCTGC CTCAGCCTCC 
22401 TGTGCCTGGC TGGGCACTGG 
22451 ACTCAGTTAG TATAGAAACA 
22501 TGGCCTGCCA CAAAATGGGG 
22551 CTGAAGGTTT CAAACTTGTT 
22601 ATACAAGTGG TATAAAAAAT 
22651 CA I I I I I GTA TTATATGTTC 
22701 AAGAAAGATT GAAGAGGATG 
22751 TCACTGGATT AGGAGCTAGG 
22801 ACCAAGTCAG TTGACCTCTG 
22851 AGAGTATTGA CTAGGTAATC 
22901 TTAG I I I I I A TGTATGTGAT 
22951 I I I I I I I I IG AGATGGAGTC 
23001 GTGTGATCTC GGCTCACTGC 
23051 CGTGCCTCAG TCTCCCAAGT 
23101 CCAGCTAATA TTTTGTATTT 
23151 CGGGCTGGTC TTGAATTCTT 



ACTAGAGTCA GGCTGGACTT TGGTATCTTA 
TGTCAGTCTT AAGGTCTGTT TTAATGTTAA 
ATTCCAAAGG GGAGGAGGAG TTAATGAGGC 
TCATGGCCTG AACTAGTTTT TCAGGTTAAC 
AGGGAGGGTT TATGAGTTGG TTGGGGGGCT 
ACACACTTTC TAGCAAAATA AATTTGTGCA 
GTGGCAATAT GTACCAAGAG ATTTTTAAAT 
TCCAAGGGAA TAATTTGAAA TTTGGAATAA 
GGAAATGGTT AAATTTAATA AAGCTTGGCA 
AATATATCAT AAACATTTAT GGTTTTGAAG 
TACTAAGCAA AGAAAATGGA ATTCTGAATT 
G TTAT ATGAA AAATATGTGT GGAAAGAAGA 
ATTTTAACAA TAG I I G II II AGGTGCTAAG 
CCCTTATTCA I I I I I GT ATT TTCCAAGTTT 
TGATAATCTC TACATCTGAG AGA I I I I 1 AT 
TTGTGGAGTC TTTAAGCCAT AAAAAATATA 
GGAGGTGAAT TGTAAGTTTA AAAATCAGCT 
ATACAGTGTT TTGCTTTCTT CTAACACATG 
ATCCATTTTC ACA MIMIC TTGCATGTCC 
CTCCAATAAA AAATCATGCT GATTTATTAT 
CATGGAACTT GCTCGAACTT GTATTGGAAC 
AGATCTGTCA GAATAAACCC TACAACAATA 
TAGTTTGAAA GTGTCAGTAA AATCTGATGG 
CAGAAACTAA AATTCAAATC TC I I C I II CT 
TTTCATTTGC TTCATGTGTG AAATTGTTCT 
TTGCTATTTC TCGCGGTCCA ATACGAGATG 
AGAG MIMA TTTCTGTAAC CAGTACAGGG 
CCAGACCGAC TCAAAATTAC AAAGTTTTTC 
GCTATATGTC TATGTGTAAG TGTGCATTCA 
CTCCTTTTAA TCTATAACTA AGGTCTGAGT 
GCCTCAGTAA GCTTACTTAA TCTAAATGGG 
CCCTTATTTT GTCTCCTGCT AAATCATGGA 
ACCTCCAATA AACTTGTTTG TGGAGGCCTG 
AATAAAACTT GTTTAATCTT AAAAGGCTCC 
ATTTTGTCAT GGTTTAAGGC CCAGGAAAGG 
GGC I II I GTT ACATTACAGC CTTTGTATAA 
TT A I I II I I G AGATAGAGTC TTGCTCTTGT 
GGCACGATCT CGGCTCACTG CAACCTCCAC 
TCCTGCCTCA GCCTCCTGAG TATCTGGGAT 
CCCAGCTAAA GTTTTGTGTT TTTAGTAGAG 
CCAGGCTGGT CTCAAACTCC TGACCTCACG 
CAAAGTGTTG AGATTACAGG TGTGAGCCAC 
CI II I IAAGC TTTTAATATT TAACTTCACC 
GTTGTGATGG AGGCCTGCAT TGGTAAGACC 
ATCCCAGTGA CTATCTCTGA GCAGTGTTAC 
TAGAAGAAAG CCATTTCTCT TCATTTAAAG 
AACATCGAAA ATTGCAGTCA CTGTGATGTC 
ATATCTTTGA AGCACTGTTT AGTCTATTGC 
AAGTAGAAGA CAATGTGGTC TGGTGACCGC 
AATCCTAGTC TTGGCTCAGT TGCTAACTTG 
TGGGCTTCAG TTCCCTAACT CATAATAATG 
TTCAAGGTGT CTTCCAGCTT TAAAACCCAG 
ATCAGAGTCT GGTTCTCAGC AATAA Mill 
TGGCTCTGTC ATCCAGGCTG GAGTGCAGTG 
AATCTCTGCC TCCCAGGTTC AAGCAATTCT 
ACCTGGGACT GCAAGCACGC CCCACCATGC 
TTAGTAGAGA TGGGGTTTCA CCATGTTGGT 
GACCTCAGGT GATCTGCCCG CCTCAGCCTC 



FIGURE 3H 



23201 TCAAAGTGCT GGGATTGCAG 
23251 TAATGCTAAT GTATACTGCA 
23301 ATAAGGAGAA GGAAAAAAAT 
23351 AAGTTGGCGA TAAGGAAGAT 
23401 TGTTTAGGAA AAAATGCCCA 
23451 GATAAATGGA AGGCAAAGCT 
23501 AAGAATTTCT CTGCATAATA 
23551 CATCCCCTGG CACCAAGGAG 
23601 AGGTAGGGTG TGTTGGACGG 
23651 CAGACAGCCT TGGGTCAGCT 
23701 AAGAATTTGA ATCAACATTT 
23751 ATAATCCCAG CACTTTGGGA 
23801 GGTTCAAGAC CAACCTGGAC 
23851 AAAAAAATTA AATATTGAAA 
23901 GAAAATTGGA AGATCTTAGC 
23951 TAGAGGACTG GTTCACCACA 
24001 GCAACTGGCC CCACCCAACC 
24051 GGGGGCATAT GAATTCTCTG 
24101 AAGAGAAGTT TGGGTCTGTG 
24151 GTGTGAAATG GTCAGTGAGT 
24201 TTCCTGGGTC AGGTGCAGGC 
24251 AGGCCAAGAA GGGAGTATTG 
24301 GCAACATGGT GAAACCTTGT 
24351 CCACAAATTT GTTTCCCACC 
24401 CI I I IGTACT CAAACTTTTA 
24451 GACACACACA CACACACACA 
24501 ATTGCTGCTA GAGGAGCCAT 
24551 CTTGCTTTGA CTCCTTGACT 
24601 TTAATTGTGC I I I I I GTTCT 
24651 AAATATTTAT GAAGTACTAA 
24701 TTTTAAGTAT AM I I IAAAT 
24751 ACATAATATA ACTTTGGTTC 
24801 CAAAATCAAC CAAATATTAC 
24851 TTCTGCCTGC CTCCTCCAAA 
24901 TTTTGAGCAG GAAGCTGAAC 
24951 TTTGTGTGCC TTTCTTTGAC 
25001 ATTTTACCTC TTATCTTCTA 
25051 TGTAAAAGCT GATCAACTTA 
25101 GCACTGTAAG GATAAAAGCT 
25151 ATGTAAACTC TTATCTTCTT 
25201 ATATGAGCTC TGCACACTTA 
25251 TAATCTTGAA TTATTGGAAT 
25301 AAGGTTAATG TTTGGTTTTA 
25351 ACTTAAAATA ATGATAGATG 
25401 AGTAAAAAGT CCAAGTTGGA 
25451 AATAAATTGT GTGGGCTTGG 
25501 TTTGGGAGGC TGAGGCGGGT 
25551 ACCTGGCCAA CATGATGGTG 
25601 TAGCCAGGCA TGGTGGCATG 
25651 AGGCACGAGA ATTGCTAGAA 
25701 ATTGCGCCAC TGCACTCCAG 
25751 AAATAATTAA TTAATTAATT 
25801 AAAATGTACA TGGGCAAAGT 
25851 GTCTTTTCCC AGAGGTCACA 
25901 CCCACGTCAC CTGTGAAATG 
25951 CAGTGCCCAG GA I I I I I ACT 
26001 GGCATATGCC AAACTTCCAG 
26051 AAACCATTTT GTTCACATAA 



GTGTGAGCCA CCGCACCTGG CCCTCAGCAG 
AGAAAAGGTG AAGAGGAGCT TTTGCTTCCT 
TTCA I I I I I C AAAGCTGGCT GCCATTGAAC 
TGAGTTCCCT TTGGAAGTTA ATTGTCCTTT 
AGAGATACTT GGCTATTGGA CTTTGAAGGA 
CAGACAATAG AGATTTACAA AAAGAATAGT 
AAATAACAGG GAM I I I I I I M i I M GAGA 
TTTGGCCTCA AGTTAGTTGT GCAGGAATTC 
AAAGTAGGCT GTTCAGAGCA GGGCATGCCA 
GCATTG MM GTTTGCTTGT ACTG C M I I C 
AAATGCTGCT GGATATGGTG GCTCATGCCT 
GGCTGAGGGG GATGATTGCT TGAGTCCAGG 
AACATTGTGA GACCCTGTCA CTACATCCAA 
GACTTTAAAA TATGCATAGT TTGTACCTCT 
AATAATCAGG TGGGTAGCCG CTGGCTCCAT 
GTCCTCAATA TGCAGAGTGG TCTCAGGCCT 
CCCAGGTGGC TGCAGTACTG CCTGAGCCCT 
CCCTGGCTGC AGAGGGTCCT CTGGGAACAG 
GAAGCCCTAG TAAAGACAAA AGTCTGTGTG 
TTCTAGAAGG TCTAGAAAGT TCATGTTTGT 
GGCTCACACC TGTAATCCCA GCACTTTGGG 
CTTGAGCTCA AGAGTTTGAG ACCAGCCTGA 
TAATGAAAAA AAAAATTATT AAAAAAAATC 
AATCTTACCG TCTATTGTAC TTACTACCAT 
GTATGAGTCT ATCTCTCTCT CCTTCTCTCT 
CACACACACA CACACACTCA TGCACAAAGC 
TTACCTCACT CCTCACTTTA ATGATTCCTT 
TCTGATTAGA CAI I I I I IGA TCI I I IAGAT 
ATAAAATAAC TCCTCAAACC AATCACATAT 
ATCTGTAAGG AGCAAAGCTC ATGATATATA 
GTTTATTGAG AATCAGATAC TATGTTTATC 
TGTCAAAAGC CTTGAGTAGG ATATATCTTT 
CTTTTGAGTC AAAACAAATC CATGTTTGAG 
TTGCTCAACA TTTCATCATA CATACATTGT 
TAAATATTAA GCCACCAGGT TGTAGCAAAG 
TAGAAATCTG ACAAACTACA AATGGTTTTC 
ATAAGAATTG ATGATATATC TGAAAGCATT 
CATAAAATTG TAAAGCGACA CAAATTTAAG 
TTTATTAAGA ATTATGGATA TTTTCTTGGC 
TAGGGATATT TGGTCTCTTG GCTGTGTCTT 
AACATCCTGT AAGTATGCTC ATTGTCAGAC 
TGTAGAAAAG AAATTAACTT CTGGGAGAAA 
TTAGATTGTT AAAAATTATA TGGATAAGCT 
ACATGGAAAG CTGTCCAAGC AATATTATAA 
GAATAGTATG TGTAGCATAT TTCCATTAAA 
CGTGGTGGCT CATGCCTGTA ATCCCAGCAC 
GGATCACTTG AGGTCAGGAG TTGGAGACCA 
ACACCCCGTC TCTACTAAAA ATACAAAAAT 
TGCCTGCAGT CCCAGCTAGT TGGGAGGCTG 
CCCAGGAGGC AGAGGCTGCA GTCAGCTGAG 
CCTGGGTGAC AGCGAGACTC CATCTAAAAA 
ACTGTATGAA TAGATACGTT CAGCAAAAGA 
TCATAGGAAA CCAGGCACAA GCI I I IAAGA 
TGGGATGTGC CAAATCCTCC AGCATTGTTA 
TGATCTATAA GAAAGCTCAT CGGATATACC 
GGGGACTGGT CACATAGGCA CCCTCTACCT 
ACTCCTGGAA AGAAAGCCCG TGTTCAGCAT 
ATAGCTGAGG CAAAGATAGC CACTCTTGAC 



FIGURE 31 



26101 ATTCAGGGAA TGGTGGGAAT TCTTCTGAAA TCTTAGTTCC CAGACACCAG 
26151 CCACGGGCCA ACATTGTAAG CAGGCCTTTC TGAGGAGAGC TTGCTACATC 
26201 AACTCTTTTC TCCACAGCTG TCATCATTGT TATTAATTAT TGTCAAGGGT 
26251 TGCACAGCCA GTGTCTGACC AAAATGTGTA CTCCATTGTT M ill GAGAT 
26301 GGAGTCCCGC TCTGTTGCCC AGACTGGAGT GCGGTGGCAC GATCTCAGCT 
26351 CACTGCAACC TCTGACTCCT GGGTACAAGC AATTCTCTTG CCTCAGCCTC 
26401 CCGAGGAGCT GGGATTACAG GCACCCACCA CCACACCCGG CTAA I I I I I I 
26451 TGTAI I I I IA GTAGAGTCAG GGTTTTGCCA TGTTGGCCAG GTTGGTCTTG 
26501 AACTCCTGAC CTTGGGTGAT CTGCCCACCT TGGCCTCCCA GAGTGCTGGG 
26551 ATTACAGGCG TGAGCCACCA TGCCCGGCCA ATGTGTACCT TTATTGCTAC 
26601 ACCATGGAGT TGAATATTAT TATGTATAAA TAACT ATTGG TTTCATACAA 
26651 TAGAAGATTT CTGGTCTATG AAGCATTTTA GAGGAAATTA AACGATGTTT 
26701 ATGTTAATTT TAAAAAGCAA GAGATAAAAT TTCATATCAA TATGACCTCA 
26751 ACTTTGTAAA ATAAACATCA I I I I IAAAAG AGATCAGAAG GAGCT ATACC 
26801 TCTGAGTGGT AAAATT ATAC ATATTTTCCC CTGTCTTT AT AACTTCCTAT 
26851 ACCTTCCAGT I I I II I ATT A TGAGTAAACA TTATTTTGAT AATAAGACAG 
26901 AATTAAAACA AAATAAAAAC TTGTTTTAAA TAACATGGCA TCTTGTTGAA 
26951 TAACTGCAGT ATCTGCTCAT GAAAGATTAG TTGATGAAAA CAATTTAAGG 
27001 TGGACCACAG TGCTTCTTTT TTATTTTTTG ATTGAGACAG GGTCTCACTC 
27051 TGTCACCCAG GCTGGAGTGC AGTGACGCAA TCACGGCTTA CTGCAGCTTT 
27101 GACCGCCTGG GCTTAGACAA TCCTCTTGCC TCAGCCTCCC AAGTAGCTGG 
27151 GACCACAGGC TCATGCCACC AAGCCCAGCA AATGTTT AAA AACCATGATT 
27201 TGGAGAGATG AGGTCTAACT ATGTTTCCCA GGCTGGTCTT GAACTCCTGG 
27251 GCTCAAGTGA TCCTCCTGCC TTGGCCTCCC AAATTGCTGG GATTACAGGT 
27301 GACCCTAGTG CTTCTAACTA CAATTTAAAA ACATTGTTTT GCTTCTTGGT 
27351 ATATTTGTTA CTTTAACACT TTTATTATTT GTTACTTTAG TAACTTTTCT 
27401 CTGATTTAGT GTCATTTCTC CTTGTCCTTT CAGTTTGAGG GTAACAACTT 
27451 ACAGC AGCT G GTTCTGAAGA TTTGTCAAGC ACATTTTGCC CCAATATCTC 
27501 CGGGGTTTTC TCGTGAGCTC CATTCCTTGA TATCTCAGCT CTTTCAAGTA 
27551 TCTCCTCGAG ACCGACCATC CATAAATTCG ATTTTGAAAA GGCCC I I I I I 
27601 AGAGAATCTT ATTCCCAAAT ATTTGACTCC TGAGGTAAGT TTTGAGGTGA 
27651 CTGTTTGGAT TTTGGCAGAG ATTTTGGGTT GCAGGTCCTT GACACGTGTG 
27701 TTCGGTTTTA GGTCATTCAG GAAGAATTCA GTCACATGCT TATATGCAGA 
27751 GCAGGAGCGC CAGCTTCTCG ACATGCTGGG AAGGTGGTCC AGAGTAAGTG 
27801 TGACTTTGGC ATGCAATCAA AAGTATTTAT TACACATGTC TCACACAGAG 
27851 AGTAATGCAA GGAAATTTCA CCAAACATAT TGAAAGTGGA CATTTTAAAA 
27901 AATACAAGCA GTATAAGCAG GAGAAAAATC ATCTTGTCAA ATGGCAACTA 
27951 GTGAGTGTGC CTGAAAGTTG TATATCTAGC TCATGCATGA CCTGCAGGGT 
28001 TCCTTCTCGT TAGTCAGGAA ACCTCCATGA AGCAGAGGAC ATGCTAATAG 
28051 AGATGCTTGA AGAGGTTGAG CCCAAACTTA ACTTTTGTGT AGTGAAGGGA 
28101 CAGAGTGGGA GAAGGTTGCA GATAGACATG GATGATGAGA TGAAACTTAT 
28151 TTTTCTAAAA GAGGATAGAC TGGCAATTAA GAATTCTGTT GCAAAGGACC 
28201 ATTGGAGCTG AAGTTAGGAT CTTGGGGCCT AATTGATAAC AGTAAGAACT 
28251 GTTACTTTGT GGTTCCCAAA GAAGGCAGGA GATATTTTAT GGTAGTAATA 
28301 AATACAGAAA AC I I I I I I II TTTTCCGAGA CGGAGTCTCG CTCTGTCGCC 
28351 CAGGCTGGAG TGCAATGGCG CGATCTCTGC TCACTGCAAA CTCCACCTCC 
28401 CGGGTTCATG CCATTCTCCT GCCTCAGCCT CCCGAGTAGC TGGGACT ACA 
28451 GCCGCCCATC ACCACTCCCG GCTAA Mill TGTA I I I I I I TAGTAGAGAC 
28501 GAGGTTTCAC TGTGTTAGCT AGGATGGTCT CGATCTCCGG ACCTCGTGAT 
28551 CCGCCCGCCT CTGCCTCCCA AAGTGCTGGG ATTACAGGCG TGAGCCACCG 
28601 CGCCAGGCCG GAGAAAACTA TTTTAGTCCT GGTGTCAAGA ATCAGCTAAG 
28651 CTGTGTGTCA GAGGGAGGGG TACGTTAAGA AAGAGAAAAT TACTAATTCA 
28701 TTTGATGCTG TGAAAGTCAA AGCCCCAGAA TTTAGCTGTA ACTGAATGCC 
28751 TGGACTTACA ATATCAGGAG GAGCAGAAAG CCTCTCAAAG GAATCCATGA 
28801 CAGGGAAATG TTATCCATTG AGACAGAGAT TCTAAAATCA AGGAAAGTTA 
28851 AAGAGAAAGT GAATGAGCCT CTTTGCCATT TAATTTGACT AACATTGTTG 
28901 TATACCAGTC TAGATTGAGA ATGTTTAGAA AATAGACAAG TACAGAGTAT 
28951 GGGACTGTGT ATTGTCCATA TTTCTAATCT AGGTAAGATA GGAGAACAAG 

FIGURE 3J 



29001 AACAA I I I I I I I I I IATTGA GATGGGGTCT CACTGTGTTG CCCAGGCTGG 
29051 TCTCGAACTC CTGAGCTCAA ACAATCCTCC TACCTTGGCC TCCCAAATTG 
29101 CTGGGATTAC AGGTGCGAGC CACCTTACTC AGCCCAAGAA CAAATTTTGA 
29151 TGGAGATAAA GACAAGCATT AGAAGATCTA CTCATACCTC AGTCCTGGCA 
29201 CTTTGGGAGG CCAAGGAGGG CAGGTCACCG GAGGCCAGGA GTTTGATGCC 
29251 AGTCTGGCCA ACATGGCCAA ACCATGTCTT TACTAAAAAT ACAAAAATTA 
29301 GCTGGACCTG GTGGCCCATG CCTGTAATCC CAGCTCCTTG GGTGGCTGAG 
29351 GCACAAGAAT CGCATGAACT CGGGAGGTGA AGGTTGCAGT GAGCTCAGAC 
29401 CCTGCCACTG CACCGTAGCC CGGGTGACAG AGTGAGACTG TCTCAACAAA 
29451 AAAAAAAGAG AGAAGATCTA CTCATAAATT CCAAACAATG TGGCATGAAT 
29501 GGAGTGGCCT GATAACCCAA GCTCTAATGA CCAAATTTAA TAACI I I I AT 
29551 TATTACCCCA TACATATTGT TTCTGTAAAT GTTAATATTA ATTTCTATTT 
29601 TTCTGAAAAA AAGTGATGTT ATATATTACT AGAAATATGC AAAGGGACTC 
29651 TGAAAAAATG Gl I I I I MCA TTTAAAGAAA TTGCATATTA A I I I I I CATC 
29701 AGTACTCTCA CTGTGTGTAA AATATCTCTG GCTAAAAAGT AAACTTACTG 
29751 TGTTATGAAA TGTAGCTTAT GTTTATACTC TTACAAGTAT CAGTATTAAT 
29801 GGTGTACAAT TTTTAAAAAA TTGAAGCTGT TTTATTTTGG TTAATTAAGA 
29851 GTGTAAAATA CAAAAAGTGA GATTCCAGGG AAAGTGCCCA CCAAGATCAA 
29901 GGATATCTGT GCCAATTAAA AGGAATGCTA TATTGCATAG AAATGAATGG 
29951 AGACCACCAG CTGGAGCCCA GAAGGCCAGA TCTGTAAGTC ATTCTAAACC 
30001 CTCCTTTGTG I I I I I IAGCT ATGGTATATG CTTTTTGTTT GTTTGTTTGT 
30051 TTG I I I I GAG ACGGAGTCTC GCTCTGTCGC CAGGCTGGAG TGCAGTGGCG 
30101 CGATCTCGGC TCACCGCAAA CTCCACCTCC CGGGTTCAAG CAATTCTTCT 
30151 GCCCCAGCCT CCTGAGTAGC TGGGACTACA GACGTGTGCC ACTATGCCCA 
30201 GCTAA I I I I I GTATTTTTGG TAGAGATGGG GTTTCACCAT ATTGGCCAGA 
30251 ATGGTCTCCA TCTCTTGACC TCGTGATCCA CCTGCCTGGG CCTCCCAAAG 
30301 TGCTGGGATT ACAGGTGTGA GCCATGGCGC CCGGCCCCGG: CTAA I I I I I A 
30351 TACTTTTAGT AGAGACAGGG TTTCACCATG TTGGTCAGAC TGGTCTCGAA 
30401 CTCCTGACCT TGCGATCAGC CTGCCTCGGC. CTCCCAAAGT: GCTGGTATTA 
30451 CAAGCATAAG CCACTGCACC CAGCTGTTAT ATT C1 1 I » I C" TTTAA I l i t I 
30501 TAATTAAAAA AAAAAI MM GTGGGTACAT AGTAAGTGTA TATATTTATG 
30551 GGGTATATGA GATGTTTTGA TACAGGCAAG CAATGTGAAA TAAGCACATC 
30601 ATGGAGAATA GGGTGTTTGT CCCCTCAAGT ATTTATCCTT TGAGTTACAA 
30651 ACAACCCAGT TATACTCTGT AACTTATTTC AAAATGTACA ATTAAGTTAC 
30701 TATTGACCAT AGGCAGTCTA TTGTGCTATC AAATAGTAGG TCTTATTCAT 
30751 TCTTTTGTTT TTTTAACCCA TTAAGCTATG GTATATTCTG ACAGACCTAT 
30801 CTGCACATGT TCATGAGGTA CAAGCTTATT GTTTGGAGTC CACAAATTTT 
30851 GTACTTAAAA TGAAGTATTC TGTACTGAGC ATTATAATGG TATTTTGTTG 
30901 GACAACTTCT AG M M I ATA TTTTATGAAA CAATGCTGTA TGCTCTTATA 
30951 AGTATACTTT AGGCTTAATT TTCTTTTTAT AACTGAAATT CTTCTAATTT 
31001 CTAATAAATA AGATTTTTCT GTATAGGAAA AGTGAGT AAC ATAGCAACAG 
31051 AAAACACTCT GCATTTAATA TTCTTAATTC TAACATATTA TGTATAGGAT 
31101 TGAGAAGTTT TTATGATATA ATAATTGATA TTTCCCTAGT GATTCTTTGT 
31151 GTTTAATTAT TTGAATTCAC TTCAGCAGAG TGTTGAATCT TTTAGGTCAT 
31201 ACTAGTGAAA TGCTTCTGGT ATGTAAATGA TAAAATGGCT ACTGTCTTTT 
31251 AATTAAAGAA TTGTAI MM AAAGAAGGCT CATGGTTAAA TTAAGAACCA 
31301 TTTGGAAGTG TATTTACTAA GTGTTTACTT GATATATAGA CATTTTAGAA 
31351 AATGTGTTGG TATATAAACA M M M I AAA AACCGATTGT TTAAGTTATT 
31401 GCCCTTCATT TGATAAAGGG CTTTATTTAT TTATTTATTT ATTTATTTAT 
31451 TTATTTATTT ATTTATTTGA AAGAGGGTCC TGCTGTGTCA CCCAGGCTAG 
31501 AGGGCAGTGG CATGTCT CAG CTCACTGCAG CCTGGATGTA TTAGTCTGTT 
31551 CTCATACTAC TATAAAGAAC TGCTTGAGAC TGGGTAGTTG ATAAAGACAA 
31601 GAGGTTTAAT TGGCTTACAG TTCTGCAGGC TGTACAGGAT GCATTGCTGG 
31651 GGAGGCCGCA GGAAACTTAT AATCATGGCA GAAGGGGAAG CAGGCTCATC 
31701 TTAAATGGCC AGAGCAGGAG AAAGAGAGCA AAGGGGGAGG TGCTACACAC 
31751 TTGTAAACAA CCAGATCTCT GGAGAACTTA CTATCACAAG AACAGTAAGA 
31801 GGGAAATCTG TCCCCATAAT CTAATCACCT TCCACCAGGC CCCTCCTCCA 
31851 ACATCAGGGA TTACAATTCA ACATGAAATT TGGGCAGGGA CACAAATCCA 

FIGURE 3K 



31901 AACCATATCA TTCCACCTTT 
31951 ATTGCAAAAT ACAATTATCC 
32001 TTTCAGCATT AACTCAAAAG 
32051 CAAGTCCCTT CCACCTGTGA 
32101 TCCAAAATAC AATGAGGGTA 
32151 AGGGAGAAAT CAGCCAAAAC 
32201 AACCTAGCAG GGCAGTCATT 
32251 CATGTCTCAC ATCCAGGGCA 
32301 TTGGGCAGCT CTGCTCCTGA 
32351 TCTCACAGGC TGCTGTTGAG 
32401 AGCTGTCGTT CAATCTACCG 
32451 TCTCACAGCT CCACTAAGCA 
32501 CCAATGCCAC ATTTCCCCTC 
32551 ACAGGCTTCT GCCTGGACGA 
32601 TGGTGGAGGC TCCCACGCCT 
32651 TAACACCATG TGGAAGCCAC 
32701 ATGGCCTGAG CTGTACCTTG 
32751 AGCCACAATA CAGGACACCA 
32801 CTGGGCTTGG TCCTCAAAGC 
32851 GATGAGAGGG GCTGCCTCAA 
32901 CCCATTATCT TGGCTAACAA 
32951 CTGCAGCTGG TTTGAATTGC 
33001 TGCAAACTTT CCTAAC I 1 I I 
33051 TGGTTTTACA TCATTTATTT 
33101 CAGCCAGGCC ACATCTTGAA 
33151 GACGCCTTAA ATCATCACTC 
33201 GGTAGTGGCA CAATGCCTCC 
33251 CCTTTGCTGC ATTTCTCAAT 
33301 AGCCTGGACT TTATTGACCA 
33351 TTTAAGAAGT CTCTAGGGCA 
33401 TTCTGAGCCC TCCACGCTCT 
33451 GTCACTTTCA CATTTTCAGG 
33501 CAATGTACTG TGTTAGTCCA 
33551 CTGGGTAATT TATAAAGAAA 
33601 CTGTACAGGA AGTATGGCTG 
33651 AGAAGGGGAG GCAGGCTCAT 
33701 TAGCCGCTAC AAACTTTTGA 
33751 ATCACAAGAA CAGCAAGGGG 
33801 CCTCGTCTCC CAACATTGGG 
33851 ACACAAATCC AAAGCATATC 
33901 AAAGCATATC ACTCGACCTC 
33951 TCCTGAATAG CTATACTACC 
34001 AC I II I I GTA GAGTCAGGGT 
34051 TCCTGGGCTC TAGTGATATG 
34101 ATAGGCGTGA GCCACTGTGC 
34151 ATAAGAAAAG TATGTTGTGA 
34201 GGGAAAACAC ATCTAAAAGA 
34251 AACACGCTGA CAI I I I I CCA 
34301 AGGAACTACC TCTGATATTC 
34351 TAAAGAAAAT GTTTATTGCC 
34401 TTTCAACAAG TCATTAGTAG 
34451 TTGTAAAACA AAGAAAACTA 
34501 CAGGTGCAGT GGCTCACGCC 
34551 GGAGGATCAT 7TCAGCCCAG 
34601 AGACCCCACC TCTACAAAAA 
34651 TACTGGACAC AGTAGCATGT 
34701 GGCAGGAGAT CCCTGGATCC 
34751 CACACCACAG TCCTGCAGGC 



GGCCCCTCCC AATTCCCATA TCCTTCTCAC 
CTTCTCAACA GTCCCCCAAG GCTTAACTCA 
TCCACAATTC AAGGTCTCTC TGAGACAAGT 
GGCTGTAAAA TAAAAAACAA GTTAGTTACT 
CAGGCATTGG GTAAATACAC CCATTTCAAA 
AAAGGGTTTA TAGACCCCAT GCAAATTCAA 
AAATCTTAAA GCTCCAAATT CCTTTGACCC 
TACTGGTGTG AGGAGTGGGC TCTCAAGGCC 
GGCTTTGCAG GCTACAGCCC CTGCGGCTGC 
TGTCTGCGGC TTTTCCAGGT GCGTGGTGCA 
I I I I 1 GGAGT CAGGAGAATG GTGGCCCTCT 
GTGCCCCAGT GGGGACTCTG TGTGGAGGCT 
TGCACTGCCC TAGTAGAGGG TCCCCCTGAA 
CTAGGU I I I CCATACATCT TCTGAGATCT 
CAACTCTTGC ACTCTGTGCA TCTGCAGACT 
CAAGATTTAC GGCTTGCACC CTCTGAAGCA 
GGCCGTTTTA ACCATGGCTG GAGCTGGAGC 
TGTCCTGAGG CTGCACAGAG CAGTGGGGCC 
CATTCTTCCC TCCTAGGCCT CTGGGCCTGT 
AGGTCTCTGA AATGCCTTCA AGGCATTTCC 
CATTTGACTC CTCTTTATTT TTGAAAATTT 
TCCCCAGAAA ATGGGTTTTT CTTTCTAGGC 
ACACTCTGCT TCTCTTTTAA GTATAAGCTC 
GCTCACAAAT ATGACCATAG GGTGCTAGAG 
TACTTTGTTG CTTAGAAATT TTTTCTGTCA 
TCAAAGTTCA AAGTTCCACA GATCCCCTAG 
AACCTCTTTG CTAATTCATA ACAAAAGTGT 
AAGTTCCTCA TCTCCATCTG AGACCTCCTT 
TATCACTATC AGCATTTTGG TCACTATGAT 
TTCCAAACTT TCCATCATCT TCCTATCTTC 
TCCAACCTCC GCCCATTACC CAGTTCCAAA 
TATCTTTATA CAATACCCCA CTCCTGGTAT 
TTCTCATACT GCTATAAAGA ACACCTGAGA 
ATACATTTAA TTGGCTCACA GTTCTGCAGG 
GGGAGGCCTC AGGAAATTTA TAATCATAGC 
CTTACATGCA GGAGGAAAAG AGTGAAGGGG 
ACAACCAGAT CTCATGAAAA CTCACTCACT 
GGAATCTGCC CCAACGATCC ATTTACCAGG 
GATTACAGTG CAACATGAGA TTGGGCAGAG 
ACTCGACCTC CCAGGCTGAG ACACAAATCC 
CCAGGCTCAA GTGATCCTAC CGTCTCAGCC 
GGTATGCACC ATGATGCCCA GCTAG I I I I I 
CTCACTGTGT TGCCCAGGCT GTTCTTGAAT 
CCCGCCTCAG CCTCCCAAAG TGCTGGGATT 
CCAGCCTAAG GGCTTAATTT TATTAAAGAA 
TTCAGAGGAC TCTTTATCAG ACCTGTAGAA 
TTTGAGGATG AATTAAATTA CGAACTGTTG 
GTTCCTTGAA AAGGTAAAAT TGATTTCCAC 
CTATTACTGT TGGGATGTTA GAGAACATTT 
TTTCAATACT TTTCTATATT TTTTACCACT 
CATTTTCTTC TAGGTTGTAT ATAGGTGAAA 
CTTCTTGTTT TAAAAGATTT TAAAAATAGG 
TATAATCCAA CACTTTGGGA GGCTGAGGCA 
GAGTTCGAGA CCAGCCTGGT CAACACATTG 
GTAAAATTAA AAAAAAAATT TTTTG Mill 
GCCTGTAGTC CCAGTTACTT GGGAGGATGA 
CAGGAGTTTG AAGCTGCGAT GAGCTATGAT 
TGGGTGACAG AGTGAGATCC TGTCTCAGAA 
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34801 TTTAAAAAGA AAAGAAAATA I I I IAAAAAT AAACATATAA TTTGTATTTA 

34851 GATTAATGAA CTAAATTTTA TACATTTACT TAAATATTTA AATAGAACTA 

34901 TATGAAAGTG CCA I I I 1 I CT AGATTAATTA TGGTCAATTC TGGGCAATTT 

34951 C I I I I I I I GA GACGGAGTCT CACTCTGTCA CCCAGGCTGG AGTGCAGTGG 

35001 TACGATCTTG GCTCACTGCA AGCTCCGCCT CCCGGGTTTG TGCCATTCTC 

35051 CCGAGTAGCT GGGACTATAG GCACCCGCCA TCACGCCCAG CTAATTTTGT 

35101 TTTTGTGTTT TTAGTAGAGA CGGGGTTTCA CCTTGTTAGC CATGGTGGTC 

35151 TCGATCTCCT GACCTCGTGA TCCGCCTGCC TCAGCCTCCC AAAGTTCTGG 

35201 GATTACAGGC ATGAGCCACT GCGCCCAGCC AATTATGTGC AATTTCATAT 

35251 GGTCCAATCT AACATATATG TGAACCATAT AGCAGTAAAA ACAACAAAGA 

35301 ATATAACATG TTACCTCTTT ACATGAGGAC A I I I I GGTTT TAATTGTTCT 

35351 TGTTATTCAT ATTCCCAACT ATTAGTTCCT AGGTCTTTCC AGTAG I I I I A 

35401 TCI I I I I I IC TCI I I I I ATT ATTAACTGTA AACTGTAAAC TAGACAGAGT 

35451 TGCCACGCTT TAGGTTAAAT TGACCCCACT TTGCTCTTTA GCAAGAAGGT 

35501 CTTGACTGGC TTTTATATCT TAATTTGATC TGTTTCTTGT CTTCTAGCTC 

35551 AGTGGCTTCT ACTCAGTTGG AAGATAAACT GTCATTTCTG GTTCTCCTAT 

35601 TCTCATTCTG TTCTGGTTGG GAAGGGTGGT GAGGGCTGGG ATGGTGATAT 

35651 GCCCATCATG GCTGTTATAT GACCI Mill AATATTTTCT CTGGAAGAAT 

35701 GATTCTGATT CAGCATCTTC TTTCCTTTAA GTCATGATGC CATTTTGCAT 

35751 TTAGTCAATT TATCAGAAAC TAAAAATGTT GCAAATCCCC ATATGTGTGA 

35801 GTTTCACTAT GCI I I I I ATT TCCCTGTAAA GTATGGTAAG GTATAAATGA 

35851 GTTTATGAAA AATAGAAAAC AATAATTCTG AGTTTAGTTT TGGATCTTGG 

35901 GTTGCCTGGG CATACTCACT AGCTAAGTAT TTTTCACATA CTAGCCATGA 

35951 AGTATGCATG ATTCATATCC ATACCTTAGC AAAATTGTAA ACCACTATAC 

36001 TATCTAGTAC TTAGGTCTTT TTGTACTCTA GGATTTGGGG ACTCTTAAGA 

36051 TTATTCTGGA AAAAAAAGTA TAGAAGAAAA ACAGCAAAAA TACACCTTCA 

36101 GTGCCTTATC TTAGCTATGG TCACTGTTAT ATTGTCAAGT ATTATAAATT 

36151 TGTATTATGG I II I! I I I I I TGAGATGGAG TCTCGCTCAC ATTGTGCAGG 

36201 TTGGAGTGCA GTGGCATGAT CTCAGCTCAC TGCAACCTCC ACCTCCTGGG 

36251 TTCAAGTGAT TCTCCTTCCT CAGCCTCCCA AGTAGCTGGG ATTACAGGCG 

36301 TGCGCCACCA TGCCTGGATA A I I I I I GTAT TTTTAGTACA GACGAGGTTT 

36351 TGCCATGTTG GCCAGGCTGG TCTTGAACTC CTGACCTCAG GTGATCCACC 

36401 CGCCTCAGCC TCCCAAAGTG CTAGGGTTAC AGGTGTGAGC CACTGCACCC 

36451 AGCCTGTATT ATGG I I I I I A AAAACATCCC CTCTTGTTTT CTTCAGATAA 

36501 AAATGATAGA AAGACCCAAA ATTGCTGCTG TCTGTGGACA TTATGATTAT 

36551 TATTATGCTC AACTTGATAT GCTGAGGAGG AGAGCCCACA AACCAAGTTA 

36601 TCACCCTATT CCTCAAGAAA ATACTGGAGT TGAGGATTAC GGTCAGGAAA 

36651 CGAGGCATGG TCCATCCCCA AGTCAATGGT AATATTGTGG TCTAGCTTAA 

36701 GCTTTGGTTA ATCTAAAAAT ATCTTTATAT ATTAACATTT ATTATTCTGA 

36751 AATCCAAATT CTCCTAACAC AAATAATCCA AGAAGAACTT TCCAAATCTT 

36801 CATTTTAAAC ACATAGTTCC CTTGACCTTT TTCTTTTGTT TGCTTTTGTA 

36851 GACAGTCTCA CTCTGATGCC TAGGCTAGAG TGCGGTGGCG CAATCTCAGC 

36901 TCACTGCAAC CTCTGCCTTC TGAGTTCAAG CGATTCTCGT GCCTCGGCCT 

36951 CTCCAGTAGC TGGGACTACA GGCGTGCACC ACCATGCCCA ACTGA Mi ll 

37001 ATA I I I I I AG TAAAGACAGG GTTTCACCAT GTTGGCCAAC CTGATCTTGA 

37051 ACTCCTGACC TCAGGTGATC TGCCCGCCTC AGCCTCCCAA AGTGCTGGGA 

37101 TTACAGGCAT GAGCCACCAT GCCTGGCCAT GTTAGTCCCT TCTTTCTATG 

37151 TCAGCCCTAT ACCTGCTTGT TAGTTGGTTC TTCAAATTCT CAGGTACCCT 

37201 CTCACCAGGC AGCCACTGAC CTCATGTGAT CCACCTGCCT TGGCCTCCTA 

37251 AAGTGCT GGG ACTACAGGCA AG AG CCA CT A TTCCCAGCCT TTCTTTCTTT 

37301 I I I I I I I GTT AGAAAGATTT TGI I I I I ATT TCCATCAGAA TGTCATATAT 

37351 GTTACACAAA TCAAATCTGT TGACATCTCA AGCTTATAAC AATTACGTGT 

37401 TCTTATAAAT TACGTGGGAA TTACATGTAC TGTGAGAAGT GTTGTAATTA 

37451 TGATGTAATG TATATTATAA TTTAGCCTAC AGAAGTAACA AAGTCTTGTA 

37501 ATTAAATAAA GCAATAAATG TGTTGATAGA TTATTACAAT TGATAAGTAA 

37551 TTGATAAATT ATCTTC I I I I TCCTGTAACC CTTCTTCATC TCAAGTCTGA 

37601 TCTAGCTTAT I I ICI IATTC ATAGAGCTGC TTAACTGTAG GCACAGACCC 

37651 ATACCCTTGC TC I I I IAATA TTCTTTCTTC CTCCTACTAA ATTCCACTAT 
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37701 ATGGCAGGTG AAAAAATAGT 
37751 TAATATGAAT CAATAATTGA 
37801 TTTTGAGAAA TAAGATATAT 
37851 CAAAATATGG ACATGTTAGA 
37901 GGTAGATTTT CAAAAAATTT 
37951 TATTTTCAAA I II I I I I CAC 
38001 ATTGACACCT AACAGAGGCC 
38051 AAAATGTTTT GTCATCAATT 
38101 TTGGCGACTC TGATTGTTTA 
38151 CACAAGATAC TATGGGATGG 
38201 CAAGAATGAT AAGAGAATAA 
38251 TATGTTTTAT CTTTAAAGAT 
38301 TCATAACACA Mil IAATTA 
38351 GTCCGTTCAA ATTGTAATAT 
38401 GATTTTACAG AAGAACTGGA 
38451 AACCACTGGG AAAATATATC 
38501 CAAGCTATCA ATTATTTTGA 
38551 CACATGAAAC AGTAGGAAGG 
38601 GCCTGGT AAC CCCTCCTCTT 
38651 TGCTGGAGGT CCAATTACAG 
38701 I I I I I I GAGA CAGAGTCTCG 
38751 CGATCTTGGC TCACTGCAAG 
38801 GCCTCAGTCT CCAGAGTAGC 
38851 GTTAAI I I I I TTGTA1 I I I I 
38901 GGAGGGTCTC AATCTCCTGA 
38951 AGTGCTGGCA TTACAGGCAT 
39001 TTTGTATAGG ATTTGGGGGG 
39051 AGAATGTCTT TTGGACAGCC 
39101 GATATTGATA CATGTGCCAG 
39151 TTTACCCTCA TTTGATTCAA 
39201 CCTAATCTCT TCTAATCCTC 
39251 CACTTTACAG CTGAGGAACT 
39301 TCCTATTTCT AAATCCCTGA 
39351 TGCTTATCAT GATGTTCCTT 
39401 CTGTCTTTGG AGCCTTAGAA 
39451 GAAATGCGGT TTTTATTTAG 
39501 AAATTCTGCC AAAAAAAAGA 
39551 TGTTTGACTG GAATTGACT A 
39601 TGGAGAGACA GCTAAACCAT 
39651 GGAGATAAAG GGATGGATGG 
39701 TATGTGGTGC AGGAAACAGA 
39751 TC I I I I I II C I 1 I I I I I I I I 
39801 GGAGTGCAGT GGTGCGATTT 
39851 AGCCTCCCCA GTAGCTGGGA 
39901 I I I I I I MAG CAGAGACGGG 
39951 ACTCCTGACC TCGTGATCCA 
40001 ACAGGTGTGA GCCACCACAT 
40051 ACCTGTACAT TAGACACCTA 
40101 CTGTTTTCCT CCCCACAACA 
40151 TCCTGAAGAA AAATCACTGG 
40201 TGTATCCCCA CTTGTTACTC 
40251 TTTCTTTTCC AATGTAAAGA 
40301 GGAACTTCAA GCCACTGGTG 
40351 TACTCGTCAA ATTGGTGCCT 
40401 ACATTCTTAT TTTCTAAGGC 
40451 CTCAACAATA TAAGTTGAAA 
40501 AATATGGGAA GCAATTAGGA 
40551 TTCAAGTCAA AGTTCATTCC 



TGTGTATATT TCATTTCTCA AAGAGGTTAC 
ATCATTAAAA TCAAATGATC ATTTGAGACA 
TTCATTCGGC ATTTATGTTC TAGGGATTTT 
AAGAAAATAG TATTCTTAAA TTGGTCTTAT 
ACTCCATAAT AGATTTCTGC AGATCTACAA 
ACTGATGTTG ACATTCCTGT GTTCAGAATA 
TGAA GACTTA AGTCTAAGAG TTCTATTTTA 
TTTTTTGTTC AGGGTTAAGA ATTTGTTACT 
TTGTTGAAAT TTTGGATGAA TTATGAAAAA 
CAATCTCATT AATAGTGAAA ATGAGATAAG 
TTTCTTCAGA AACATATAAC GGGAAAAGCA 
ATTGGGCACT GTTGTGGTTT TTATCGATCT 
TCTCCACAAT TATCAAAAGT TATGTCTTCT 
CCATATTGGA CTCAATTAGT GAGGGACATA 
GCAGCCACAA AACTCCCTTT CTCTTTTCTC 
TTCCACTCTT TGTCTAGATT TGAGAGCTGT 
CCACATGTGA TTTTATATCT CCCAAGCTCT 
GTCCTTCTCT TTCCTGGATG CCCCTTTGAT 
TGTATACTCC CTCATCCCCA GCTTTCTGTC 
GCCATGGGAT GGAGGAAAAG GATTTTTTTT 
CTCTGTCGCC CAGGCTGGAG TGCAGTGACG 
CTCCGCCCCC TGGGTTCACA CCATTCTGCT 
TGGGACTACA GGCGCCTGCC ACGACACCTG 
AGAAGAGACA GGGTTTCACT GTGTCAGCCA 
CCTTGTGATC CACCCGCCTC GGCCTCCCAA 
GAGCCACCAT GCCCGGCCGA GGAAGAGGAA 
TGGAGAGGGA ATAGGTAGAC AGAGAGATAG 
CCTGGGTGTT GGAATCATTT TTCTCATGAA 
TTAGGCTTAT GAGACAGATG AGTGCTACAA 
GAACTATCTG TGTGTCAGGC ACTTCACAGC 
GTATCAATTC CTGTGAAATG GTACCATGCC 
AAGACTCAAA GACTTTAGCT TGTCATTTCA 
TTATTAACTT GCCTCTTTGT AAATTGGGGA 
CCTAAAGGAG TTATTTCTGA AATTACAGTT 
GTTACTCGTA TTCCAAAAAA CTTATGGTCT 
CAACCAATAA TTACAGAAAT GTTTTACAGG 
TACATAAAAT GTGAGTATAA ACTTGAAAAT 
AAATTGTGCT GGAAAAATAC CTTAAACATT 
T A II I C I II C CTCATTAAGC ATTTATGTGC 
AGGGACACAT TCTGCTCTCA GGGAGCTCAG 
TATGCAGCCA TTCTTTTTTT TCI I I I CTTT 
GAGATGGAGT CTCACTCTGT CACTCAGGCT 
TGGCTTACTG CAACCTCTGC CTCCCTGCTC 
TTACAGGTGC CCACCACCAC GCCCCACTAA 
GTTTCACCAT GTTGGCCAGG CTGATCTCGA 
CCCACCTCGG CCTCCCAAAG TGCTGGGGTT 
CTAGCCACTA TTGTTAAATC AGGAGCGACA 
CACAAAGCGT GAGAACTTCT GGGTGTGGGT 
TATTATAGAG AATGGAAGGA CTGAATCTTG 
ATAAGAATAT TTTTCTGTTT AATCCTCTCC 
TTCATCCTTT TTTCCTTTTG ATTCCAAAAT 
TTCTGTAACT GTGAACTACT TCTTGAACTT 
AATTGTGAAT CTCATTACTA AACTGAAAAT 
AAGATTCGTT CAAGTTTCTA CTTAAGCTGA 
CTGCTGAGTA CCTTCAGAGA AAATTTGAAG 
GTGGAGAAGC AATTGGTAAG TAAAATACCA 
ATTTCCTAAT AG II I I I CTG TTCACAGATT 
ACCAGAAGGT CAAGAATACT CTCTACTAGT 



FIGURE 3N 



40601 CCCCAGTTTT TTTTG I I I I I 
40651 TGAGACAGAG TCTCGCTCTG 
40701 TGGCTCACTG CAACCTCTGC 
40751 GCCTCCTGAG TAGCTGGGAT 
40801 TTCTGTATTT TTAGTAGAGA 
40851 TCGAACTCCT GATCTCGGGT 
40901 GGGGTTACAG ACGTGAGCCA 
40951 TAGCTAAATA AAATAATGGG 
41001 GGTTTCTTCC TTGGCTCTAT 
41051 AGCCTATCTG AGCCTT AATT 
41101 AGATGAGCTT CAAATTTCCT 
41151 GTTAAAGATT CTGCTGCACA 
41201 TGGCTCATGC CTGTAATCCC 
41251 ATAAGATCAG GAGTTCAAGA 
41301 TCTACTAAAA ATACAAAAAT 
41351 CCCAGCTACT TGGGAGGCTG 
41401 GGAGGTTGCA GTGAGCCGAG 
41451 AGAACGAGAC TCTGTCTCAA 
41501 AAAAAGTGAT TAGGCCAGAT 
41551 TTGGGAGGCT GAGGTGGGTG 
41601 CCTAGGCAAC ATAATGAGAC 
41651 CCAGGTGTTG TGGTGTGTGC 
41701 CCGGAGGATT GCTTAAGCTT 
41751 GCGCCACTGT ACTCCAGCCT 
41801 AAAAAAAAAA AAGAAAGGCT 
41851 CACTTTGGGA GGCCAAGGCG 
41901 ATCCTGGCTA ACACAGTGAA 
41951 AGCCGGGTGT GGTGGCGGGT 
42001 GACAGGACAA TTGCTTGAGC 
42051 TCATGCCGCT GTACTCCAGC 
42101 GAAAAAAATA TATA I II I I I 
42151 TTACCACAAA AAAAACTCCC 
42201 TTATTTATAG GAAAAAGTCA 
42251 AAGTATGCAT TATAAATTAT 
42301 TTCTGCCGAG CCAAATTACA 
42351 AAGAGCCTAG ATTCCAGGAG 
42401 CAGGTTAAAA ACTGTTTAAT 
42451 ACTGTCTTTT GTACTGTAAT 
42501 CTGAATATGT CAACACCATG 
42551 ATTTCACTTT CTCAGATAAT 
42601 CCTTGAGATT CATAGAGTAA 
42651 ATTAGTTTCC TGGGCTGCTG 
42701 CACAATAGAA ATGTATTGTC 
42751 TCAAGGTGTC AGCAGGACCA 
42801 TCCTTGCCTC TTCCTAGCTT 
42851 CTTTACTTGC AGTCGCATCA 
42901 CTTCCTGTGT GTCCCTGTCT 
42951 GTCATATTGA ACTAAGGGCC 
43001 ATCTGCAGCG ACCCTGTTTT 
43051 GTTAGGACTT CAACATATCT 
43101 AAGAGGGTTA TATATAAAAG 
43151 ACACATTTGG GTTATATATA 
43201 TCACAGGGAG TGAAGTACTG 
43251 TAGGAGCAAC ATA I I I I I CT 
43301 TTGTAGGCTG CAGAAAGATT 
43351 AGTTTCTGGA TTGGGTTTGC 
43401 AATACGCCAA CAGTACCACA 
43451 GGAGAGAACC AGAGGTAAAT 



GTTTTTGTTG TTGTTGTTGT TGTTGTTTTC 
TCACCAGGCT GGAGTGCAGT GGTGTGATCT 
CTCCCAGGTT CAAGCAATTC TCCTTCCCCA 
TACAGGCGCC CACCACCACG CCCAGCTAAC 
CAGGGTTTCA CCATGTTGTC CAGGCTGGTC 
GATCCACCCA CCTAGGCCTC CCAAAGTGCT 
CTGCACCTGG CCCGAGTCCC CAGI I I I IAA 
AACAGGCTTG AATCACCCCC TTAGCAGTCC 
CTCTTCTGTG GGACCTTGGA CAGTTCATTC 
TCCTTTTCTA TAAATGACAA I I I I IAGAGT 
TGCAGTGCTG TAGTGCTTTG GTTCTATTTT 
TTAAAAAAAG TGACAAGGGG CCAGGTGCGG 
AGCACTTTGG GAGGCCAAGG TGGGCGGATC 
TGAGCCTGAC CAACATGGTG AAAGCCCGTC 
TAGCCAGGCA TGATGGTGCA CACCTGTAAT 
AGGCAGGAGA ATTACTTGAA CCCAGGAGGA 
ATCGCCCTAC TGCACTCTAG CCTGGGCGAC 
AAAAAACAAA AAAAACAAAA ACCAACAACA 
ATTATGGCTC ATGCCTGTAA TCCCAGCACT 
GATTGCTTGA GCCCAGGAGT TCGAGACTAG 
CTTATCTCTA CCAAAAAAAA CAAAAATTAC 
CTGTAGTCCC AGCTACTGAG GGGGCTGAGG 
GGGAGGCAAA GGTTACAGTG AGCTAAGATT 
GGGTGACAGA GTGAGACTCT GTCTTAAAAA 
GGGCTTGATG GCTCATGCCT GTAATCCCAG 
GGCAGATCAC GAGGTCAGGA GATTGAGACC 
ACCCTGTCTC TACTGAAAAT ACAAAAAATT 
GCCTGTAGTC CCAGCTACTC GGAAGGCTGA 
CTAGGAGTTG GAGGCTGCAG TGAGCCAAGA 
CTGGGTGACA GAGTGAGACG CTCTCAAACA 
AATGCTTTAT AATTAAGAAA ATTCTACTAC 
AAATACTGAG TTTGCTTAGT GATATAATTC 
ATGTCAAATC AGAAGATGTT TCCGAAATCA 
TTCATTCAAT AAATAGGGTC TTCGTCCATC 
ACCAGAGACA AGAGCTAAGA AGTAATGGAG 
CTGCCATTTA GGAAAAACGA AATGAAGGAA 
TCCAGGGCTA CCCTTGTATT TCTTTGTATT 
AGGGAGTTAC TTCTATTTCC TACAGTGCCC 
CTGAGTGTTA TAGGGGATAC AGAGTTAGGT 
GCGATATGGC TAAGTTCATA AAGCTTTTCA 
CTGTCCATCA GTAACAGGTT TGGGATTTGT 
TAACAGAGTT CAACAAACTA GGTGGCTTAA 
TCACAATTCT GGAAGAGTGG AAATAGAAAA 
AGTGCCCTCT GAAACCTGTA GGGGAATCCT 
CTGGTGCGTC ACTGGCAATC TTTGGCATTT 
CTCCATTCTC TGCCTTCATC ACATGCTGTT 
TCACATGGCC ATCTTCTTGC AAGGACATCA 
CACTGGTATG ATCTCATCTT AAGTAGTCTC 
CAAATGGGGT CACATTCTAA GGCACTGGGG 
C I I I I I GGGG AGGAACAAAC TTCAGTGCAT 
TGGGATTTAT AAAGTAAGTG TACATCATGA 
AAATTGAGCT CTGTAGCTAA AGCCACTGTC 
CAGCCAAAAC ATAAGGCAGA TTATCATCTT 
AACCTTATTT TATATTACAC ACTTTTGAAA 
A I I I I I GTTA TGGTGTTCAT AAACATTTAA 
TTTCCAGGAA TATTGGAAGC AGTTAGAGGA 
ATGACATGAA AGAAATTAGA AAGAAGATGG 
TCATTCTTCT AGGGGAAACA TTGTTCTATC 



FIGURE 30 



43501 GATTTAGAGC TAACTAAATT 
43551 TAGAAAAGAT AAAGTTTTAT 
43601 AATAGGAAAG CTGTCAGACC 
43651 GAGAGACTAC ACAAGATAAT 
43701 TTATTAAAAG TACTTATTGA 
43751 TTTATTGAGT GGCAACTTTA 
43801 AAAATGAGTG TAAGTTCATG 
43851 ACACTAGCAT Mill CAACT 
43901 AAGGCAAAAT CTGAGTGCAG 
43951 CGCCACATTG GAAATTCATC 
44001 GGCACAGATC ACTGCI I CI I 
44051 GTGTTTGGAA ACCATGTTGT 
44101 GAACACACTC AGTGCCCTGG 
44151 TGAGACAGCT GTCTCCCATC 
44201 TAGAGCTACC ATGCTGCTGA 
44251 CAGTGCATAC AGATAAATTC 
44301 CAGTTGTGTT TGCCAGGCCA 
44351 CTGCTCTCAT ACTGCTAATA 
44401 AGAAAAAGAG GTGTAATAGA 
44451 ACAATCATGG CAGAAGGCAA 
44501 GCAGGCAAGA GAGCTTGTGT 
44551 TCTTGTGAGA CTTACTCACT 
44601 CGATGATTCA GTTATCTCCA 
44651 TTACAATTCA AGGTGAGATT 
44701 CGTTCATACT TTCTTTTATT 
44751 TCCAAAGTCA CGTAGCTATT 
44801 CTCTAGGCTG GGCACGGTAG 
44851 GCTGAGGTGG GAGGATTGCT 
44901 AACATAGGGA AACCCTAGCT 
44951 ACACACACAC ACACACACAC 
45001 TCTCTCACTC TCTCTCTCTC 
45051 TGGTTTGCAC CTGTAGTCCT 
45101 GCTTGAGCTA GGAGCTAAAG 
45151 CCCCAGCCTG GGATACAGAG 
45201 ATAAAATGAA AATCTGACTC 
45251 TCCTAGCATT TTGGGAGGCC 
45301 TTGAGACCGG CCTGGGCAAC 
45351 AAAAAATTAG TGGGTCATGG 
45401 GAGGCTGAGT TGGGAGGCTG 
45451 CCACTGCACT CCAGCCTGGG 
45501 AAAAAATACA ATTATAACCA 
45551 ATAAATGGTT GGAATGACGA 
45601 CCCTGTAGGT GCACAGGAGC 
45651 ATTTATGGAA GAAGTGGGTG 
45701 AGTTAAAGGA GGAGAGTGGG 
45751 CAGAGGACTG TGTGTGCTGG 
45801 AGCAGAGGCC TGGCTCATGG 
45851 TAAGGCTGAT CTGGGATGGG 
45901 ACATTGGTCA GGCGCGGTAG 
45951 GGCTGAGGTG GGCGGATCAC 
46001 ACACAGTGAA ACCCTGTCTC 
46051 GGTGGTGCGT GCCTATAGTC 
46101 TGGCATGAAC TCAGGAGGCA 
46151 GCACTCCAGC CTGGGTGACA 
46201 TGGTGGATAT CAGACATTTC 
46251 ATACTGAAGC TGTGGCGGGG 
46301 CCATTTCATC ATTGATCGCG 
46351 AGAACCAATC TTGTTGACTG 



GAGCTGGTAT TAAAAGTAAT GATTTCCTTA 
CATAGAGATA ATCATGTAGA CTTCTTTTTT 
TCATTGGAGC TTCAGTTTAT TATGGTTTAT 
AAGGATATCT GAGATTCTCA GGAATGGCTA 
TTGTTTCCTT CATGAATCAC TCAATACATA 
GACAAGAGCT GGATTAGATG CAGAAAGTCC 
CCCAGAAGGT GGGAAAAAAA CAAACCCAAC 
CTCTGCAGGG TAAGGTTCTA AAGGCTATTT 
TTGAACTGAT TCTTAAAAAT CTCTTAAAGG 
CTTCCATCTC CCAAGAAGGT CTCTAGAGTT 
CAGAAGAGCT TCACATGAAA TAGCCAGCCT 
AAGAAAGACA CATGGCTATT GAAACACTAG 
AATGCTCTCC TAGGAGAAGC TTGCAGGCAC 
CCACATGCAC TTGGCCACAC ACTCATTGAG 
AATTGATCTC TCTCTCTCTT TCTCCCACCG 
ATATAAGTCA AATGAATGTA TGGTGCAATT 
TGAACTAGAG CTTTCACATA CTGTATTAGT 
AAGACATACC CAAGACTGGG TAATTTATAA 
CGCACAGTTT CACATGGCTG GGGAGGCCTC 
AGGAGGAGCA AAGTCATGCC TTACATGAAG 
AGGGGAACTC CTATTTACAA AACCATCAGA 
ACCATTAAAA TAGTATGGGA GAAACCACCC 
CCTGGCCCCA CCCTTGACAA ATGAGGATTA 
TGGGTGGGGA TACAGAGCCA AACCATATCA 
TACCCCTGTA CAGAGAAGTT AAGTAGCTCA 
ACAAGGCAGA CGAAATATTT AAATATCTGA 
CTCATGCCTG TAATCCCAGC AATTTGAAAG 
TGAGCCTAGG AGTTTGAGAC CAGCCTGGGC 
CTAAACACAC ACACACACAC ACACACACAC 
ACACACACAC TCTCTCTCTC TCTCTCTCTC 
TCTCTCTCTC TTTAAATTAG CCGGACATGG 
AGCTACTTGG GAGGCTAAAG CAGAAGGATT 
GCTGCAGTGA GCCATGATTG TGCCACTGTA 
CAAGACTCGG TCTCAAAAAA ATAAAGTAAA 
TAAAACCCCT ACTCATGTTC ATGCCTGTAA 
AAGGCAGAAG GATCGCTTGA GCCCAGGAGT 
ATAATGAGAC TCCATATGTA CAAAAAATTT 
TGGCAAATGC TTGTAGTCCC AGCTACTCAG 
AGGTTGAAGC TGCTGTGAAC TGTGA I I I I I 
CAACAGAGGG AGGCCCTGTC CCAAAAAATA 
CTATTCTTTC TGGCATATGC AGTTCTACTT 
GACACATGTA TAAAACAATC ATAGAGTAAG 
CAAGATGAGC AGAATGAGCA GGTAGCGGGT 
GGGCCTAAAG AGTGGAATTT GGAGAGGCAG 
CATTCTGAAA GAACCATGCA AAGGTTGGGA 
GAGGGGCAGC CTGGAGGTTG TTCTCTCTGG 
GCAGCCTGGA GCCATCATCA GCCTTATGTC 
CAGCCTGAGC CCTGCAAAAA TGGATAGCAG 
CTCACGCCTG TAATCCCCAG CACTTTGGGA 
GAGGTCGGGA GATCGAGATC ATCCTGGCTA 
TACTAAAAAG CCAAAAAATT AGCCAGGCGT 
CCAGCTACTT GGGAGGCTAA GGCAGGAGAA 
GAGCTTGCAG TGAGCCAAGA TCGCACCACT 
GAGTGAGACT CCATCTCAAA AAAAAAAAAA 
TACAGAGGCT ATGGAGGAAG GATTTGGGAA 
AAAAGGAGCT ATAAAAAGGG TTCTGTCCAG 
CATCAGCCAA GTAGCCTTCA GAGCTCACTC 
TGTATAATTT GTGTTAAAGG AGAACTCAAA 



FIGURE 3P 



46401 AATAAGTCAT AAAACCTATT 
46451 AAGATGCATC TGAGGGAGAA 
46501 AGATGTGTGT GCTTGCAGTG 
46551 TTCTCTGTGC AGCAGAACCA 
46601 ACGGAGTCTC ACTCTGTCGC 
46651 TCACTGCAGT GTCTGCCTCC 
46701 CCGAGTAGCT GGGACTACAG 
46751 TATTTTTAGT AGAGACAGGA 
46801 CTCTTGACCT CGTGATCCAC 
46851 TAGGCATGAG CCACCGCCTT 
46901 TCTCTCTGCC CCAGATGGTC 
46951 AAGCTTGCTG GGTGATTTAG 
47001 TTAGTATAAG AACGTTCCAC 
47051 CACATTGGTC AGGAAATCTG 
47101 I I I II GGTGG AGGTAAGAGC 
47151 CAAGAGAGTT I I I I I CCCTT 
47201 GATGAATCTG TTGATTTCAA 
47251 TGACCTGATA AACAATGCTT 
47301 GTTTTACTTT CCAAGGAGTT 
47351 GTGC M I I AT TTTTGCTTAG 
47401 TAAGAATTAT TATTATCTTT 
47451 CCAGGCTGGA GTGCAATGGC 
47501 CCAGGTTCAA GCGATTCTCC 
47551 AGGCGCCTGC CACCGCGCCC 
47601 GTCTCACCAT GTTGGCCAGG 
47651 CACCTGCCTC AGCCTCCCAA 
47701 GCCTGAGCGA ATTATTATTA 
47751 TAAATTATAT TTATTATTAG 
47801 AATCCTCAAG GTATTATAAA 
47851 AGTCACATTC CATAAAAACA 
47901 TCACAI MM TAACCTGGTT 
47951 TAAGTCAGGT AATTAAAAAT 
48001 CAGGAATTGG GTGAATAAAA 
48051 TTGAAAGAAA ACACACCAAA 
48101 ATTGGTGTGA GTAGGAATGA 
48151 TAAGAAGGTT ACAGAACTTT 
48201 TCCTTAGTTC TGTCAGCACT 
48251 CTGAAAGACA GAAATGGTGG 
48301 GCCGAGGATC CCTTGAGCTC 
48351 TGAGACCCCA TCTCTAAAAA 
48401 GCTCCTCCAG AGCAATCCAG 
48451 GAAATAATTA TCACTTAATT 
48501 TC I I M II GA AGGGAATATG 
48551 CTCTACTTGT AGAGTTATCT 
48601 CTACACGACA ATCAATGGCG 
48651 ACTTTTGTGA GTTGTTCCTA 
48701 TCCCCCAAAC ATACCCAAAG 
48751 TTTAAATGAC ATA I II I MA 
48801 AAATACATAT AACATCGCCT 
48851 TTAATTGATC ATTCTTGGGT 
48901 ATAGGACAAC AGTGGAGGGA 
48951 CTCTGGTTTT CCTAGGCAGA 
49001 CCCTGGGTAC TTGAGATTAG 
49051 GCCTTCAAGC ATCTGTTTAA 
49101 TTTAACCCTG AGTGGACACA 
49151 GTAAGGTCAC AGATCAANNN 
49201 NNNNNNNNNN NNNNNNNNNN 
49251 NNNNNNNNNN NNNNNNNNNN 



TGGTGAAGAA GAGTAACCTG CCTGTCCATC 
GCACCTGTGC AGGTAATGAT GGCTATGATC 
TGTGTGCTCT ACCCCAAGTG GCTCTTACCC 
CGT AGGGAAC II I I I I II I I TTTCTATGAG 
CAGGTTGGAG TGCAGTCGTG CGATCTTGGC 
TGGGTTCAAG TGATTTCCTG CTTCAGGCTC 
GCATGCGCCA CCATGCCCAG CTAA I I I I I G 
TTTCACCATG TTGGCCAGGA TGGTCTTTAT 
CAGCCTCGGC CTCCTAAAAT GCTGGGATTA 
TTTACCAGAA CTTTTAAAAA CTCAGATGCC 
TGGGATGAGG CCCAGGCATT CTGTCTGCAG 
TAAGCAGCCA AGTTTGAGAA CTGCTGCTAT 
TCTCTGGAGG GTCTAAGTCA GTGTATCAGA 
AGTCAAGTTC TCTTCCATTT CAACCTTATG 
CTGGGCAGAG TTGAAATAAC AAATAAATCT 
CTGAGAAAAG ATAATGCAAT TATAATACAA 
CCAATTCTGA GAATTATTAA ACCTGTGAAA 
TTATGGTTAC ATAAAATAAT TACATAAAAT 
ATATTTATTT TGCGAATAAG AAGCCCAAGT 
TAAGAAGATT CTCAATGATT TGGCCCATAC 
I M I I I IAGA TGGAATTTCG CTCTTGTTGC 
ACGATCTTGG CTCACCGCAA CCTCCGCCTC 
TGCCTCAGCC TCCTGAGTAG CTGGAATTAT 
AGCTAATTTT TGTATTTTAG TAGAGACAGG 
CTGGTCTTGA ACTCCTGACC TCGGGTGATC 
AGTGCTGGGA TTATAGGCAT GAGCCACCCC 
TCTTTATAAT TAGAGTAATT CTCTGTGTTT 
AGCTTGGTCG AGAGTCAACT AGAAATGGAA 
CTTGTCATTT AAAGGTGCGA GTAGGATCAC 
CGGCTCAGAT GTTACAGACA TGTTTTTCTC 
AGAGTAAATC CAGTGCCTTA AAGTTTTTAA 
AAACCACTGG AAGCCTCAAA AAGTTTGTAT 
TCTTGTATAT TTTATGCAAG AGGAGTAACT 
ATGCCAATGG TGGTAATTGG TGGTATCTGG 
TTATTGTCTC TCTAC I 1 I I I AGAI MIMA 
TACTACAAAT ATGTATAATA AAGTATCCGT 
CTAATCAATA TCTTCAAACA AAAAAGCCAT 
CACGAGACTA TAGTTCCAGC TATTTAGGAG 
AGGAGTTTGA GACCAGCCTT GGTAATATAG 
AAAAGAAAAG GCATCTGATA TTTCCTGAAG 
CAGCAGATAC CTTTGCAAAC TTTTGTAAAG 
TGTCTAATTT TTGGATTTAG GTTTTAATTA 
CAGCTATATA ATAAGACACT TTAAAAAAGT 
TTCCAAAATA CTGATTTGAA CATTATTTCT 
ACTGCCATTT CTCTTAGCAT GGCATGCTAG 
ACAGAATGTT CCAGCCTCAT TGCTCACATT 
CTCTAAATGT CTCAGATTAC CI I I II I I I I 
TTTCTTTAAG TGA I I II I II CACTGTGGTA 
TTACCACCCT AACCA Mill I I II II I I II 
GTTTCTCGCA GAGGGGTATT TGGCAGGGTC 
AGGTCAGCAG ACAAACAAGT GAACAAAGGT 
GGACCCTGCG GCCTTCCGCA GTGTTTGTGT 
GGAGTGGTGA TGACTCTTAA CGAGCATGCT 
CAAAGCACAT CTTGCACCGC CCTTAATCCA 
GCACATGTTT CAGAGGGCAC AGGGTTGGGG 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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49301 NNNNNNNNNN NNNNNNNNNN 
49351 NNNNNNNNNN NNNNNNNNNN 
49401 NNNNNNNNNN NNNNNNNNNN 
49451 NNNNNNNNNN NNNNNNNNNN 
49501 NNNNNNNNNN NNNNNNNNNN 
49551 NNNNNNNNNN NNNNNNNNNN 
49601 NNNNNNNNNN NNNNNNNNNN 
49651 NNNNNNNNNN NNNNNNNNNN 
49701 NNNNNNNNNN NNNNNNNNNN 
49751 NNNNNNNNNN NNNNNNNNNN 
49801 NNNNNNNNNN NNNNNNNNNN 
49851 NNNNNNNNNN NNNNNNNNNN 
49901 NNNNNNNNNN NNNNNNNNNN 
49951 NNNNNNNNNN NNNNNNNNNN 
50001 NNNNNNNNNN NNNNNNNNNN 
50051 NNNNNNNNNN NNNNNNNNNN 
50101 NNNNNNNNNN NNNNNNNNNN 
50151 NNNNNNNNNN NNNNNNNNNN 
50201 NNNNNNNNNN NNNNNNNNNN 
50251 NNNNNNNNNN NNNNNNNNNN 
50301 NNNNNNNNNN NNNNNNNNNN 
50351 NNNNNNNNNN NNNNNNNNNN 
50401 NNNNNNNNNN NNNNNNNNNN 
50451 NNNNNNNNNN NNNNNNNNNN 
50501 NTACGAAAAC CAGTCAGGCG 
50551 CGGCAGGCTG AGGCAGGAGA 
50601 TGGCAGCAGT ACAGTCCAGC 
50651 AAGAGGGGAG AGGGAGAGGG 
50701 TTTAAAGCCC TACTTATTTA 
50751 TTTGTAAAAT GCTTATTTCT 
50801 TCTTGCAGAA CACATCTATC 
50851 I I I I I CTCTT TCTCCCTTAG 
50901 AGCATTTCCA AGTACTTCAT 
50951 TAGCTATTTG CCCTGCAAAC 
51001 ATGAAGTCTG AAATCAGGAC 
51051 AAGTTGGTTG TCTGATTTGC 
51101 TAAGTGCTCC ATTACTTTAC 
51151 AACCCCAATC TACTCCAGTG 
51201 AGCTCTCCAG GGTCATGTCA 
51251 TGATTAACTG TGTAGCCATG 
51301 GCTGGTGTGC AGTCTCTTGC 
51351 GACTTGAGTC CGCCATCTCT 
51401 AATTGATCAT AATAAACCAC 
51451 TGGCAGGCAC ACCCATGAAA 
51501 TCCATCACCT GCAAAAGTTT 
51551 ATTCTCCCTA CCTCTCCTCC 
51601 TTTCTGTCGC TGTAGGTTGG 
51651 TAAATGGAGT CGTGCTGTAT 
51701 ATAATAATGT TGAGATTTAT 
51751 ATTCTTTATT GCTGAGTATA 
51801 GATCCATTCA CTGTAGATGG 
51851 TATAAATAAA GCTGCTAGGA 
51901 TTGAGACAAG TTTCGCCCTT 
51951 CTCAGTTCAC CACAACCTCT 
52001 CAGCCTCCCA AGTAGCTGGG 
52051 ATTTTGTATT TTTAATAGAG 
52101 CTCGAACTCC CGACCTGAGG 
52151 TGGGATTACA GGTGTGAGCC 



NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
TGGCGGTGCG CCCCTGCAAT CGCAGGCACT 
ATCAGGCAGG GAAGTTGCAG TGAGCCGAGA 
TTCGGCTCGG CATCAGAGGG AGACCGTGGA 
AGAGGGAGAG GGAGAGGGAT CAGATTACTT 
AAAAGACATC TTCCTTTTAA CCTCCAGGCT 
CTACTGAAAT ATCCCTTCCC TCTCTTCTCT 
AGACCTCCTG GTGAAGTTTC TAGCACAGCT 
AATTAATAAC TGCCTCATCT GTATTTCCAC 
ACACCAGCCT GTGTCAGTTT GAAGCATTAT 
TTGGGAAGGG I I I 1 I CTGCC TTGCAGT AGT 
TATGACTTAT CTATCTTACT TATATTTGTG 
TTGGATAGTC TGGTCATCTC AATTGTACAA 
TTTCCTTAAA ATACAACGAT CTCAGATTCC 
GGTGGGACAT TCAACCTTAG TGTGCTGTCA 
TCTGAAAGGC CCTCTTGGCC CTGTGAAGAC 
GAGTCTGGGA TCTTGAGGCA GGAACTCTAG 
TCACTACTCC AATGTACTGC CACAGATTAG 
TTAAAAAAAA AAACAGTTTT ATTGAGATAT 
ACATATTTAA TGTATATACT TTATAAAATT 
CCCATCACCA CAATCAATAT AGTGAACATA 
GTTGCGCCCT TTTGTAAACT CCTCTCTCTT 
CATCTCATCC CCATGCAATC ACGGATCTGC 
TTTGAATTTT CTAGATTTGT TTGGATTACA 
GTGCTC MM TTCTGGCTAC TTTCATTCAC 
CTATGTTGCA CATATTAATA GTTCATTATT 
TTCTATTGTA TGAATGTATC AAAATTTATT 
ATATTTGGGT TGTCTCCAGC TTTTGGCTAT 
ACATTCATAT ACAAATCTTT TTTTTTTTTT 
GTTGCCCAGG CTGGAGTGCA ATGGCACAAT 
GCCTGCTGGA TTCAAGTGAT TCTTCTGCCT 
ATTACAGGCA TGCGCCACCA CGCCTGACTA 
ACAGGGTTTC ACCATGTTGG TCAGGCTGGT 
TGATCCACCC ACCTCAGCCT CCCAAAGTGC 
ACTGTGCCTG GCTATCATGT ACAAATCTTT 
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52201 ATGTGGTCAT GTGCTTCTTT TCI I ICI I I I GGGTAAATAC ATTGGACTGG 
52251 GATGGATGGA TCATATAGTA GGTGTATATT TAACTTTCAG AGAACTACCA 
52301 AATGGTGTTC CAGAATGGTC GCACTGTGTT ACACTCCCCT TGACATTGTA 
52351 TGAGTGTTTC AGTTCTCTCT GTGCAGCTCT CTCCTCTTTG GGTCTTTGTC 
52401 TTTCAGACTC TAGCACCTTA ATACCCCCCA AGCCTTGTCT TATCAACTCA 
52451 GGGAGTTGGC CACACTCATC TTCGGTTTCC ATCCCTGCAC CTCTTCAGTT 
52501 CCCCATCCCC GCACCATGGC TTGCAAACTC TCTCAAGACA GGAGGCTGGG 
52551 GCAGTTGCAG GGCTTGTCTC ATTGGTTTTC TGI I I C I i AG GGATTACTGT 
52601 CTTTCATTGC TGGATGTCTA ATGTATTAAA AACCATTTAT CTATTATATG 
52651 TTTGATTTGG CTCTTTGGTT GTTTCAGGTG CGGAATTAAA TCTGGTTTCT 
52701 GATACTCTGT CTTGGCTGAA AGCATACGTT TTCAGTGCCC ACTGCTGGAG 
52751 AGGGGTGGAG GGCACTCAAG AGTTCCATTT GGACATTGAG TTAGAGAAGT 
52801 TGTGAGAGTT TACATACCTG CTCTGGAGCC TTTACCCCAC TGTTCCCTCT 
52851 GCATGGAAAA TGCTCTCCCC AGACTGGCAT ATGCCAAGGT CCAATATCAT 
52901 TCCAGGGCTT AAATTGATTG CCAGATAAGC TTTGCCTGTA TTACTCTCAC 
52951 TCCCTACTCA TTTTCTGTCC TGTTATCCT A TTTTGTTCCC TTGATAGCAC 
53001 TTAACACTTT CTGAAATTAT GTCATTCGTT AACTCATTTA TTACCTATCC 
53051 TACTCCAGTA AAATGTAATT TTCGTGTCAG CAGGGACCTT TCTGGTCATG 
53101 TCCACTGTGC TAACCCATTT TGAGGGTTTC TGGCCCCTGG GGAGTGCTCA 
53151 GTGTGAATTT GTGGAGTGAA TATTAAGATG AAGATAATGC TAAGTAGGCA 
53201 GTTGGATATG TGAGTCTGGA GCTCAGAGGA GAGGAAAAGT GAAGCCTGAA 
53251 GATACACATT TAAGAGTCTC TGCTTAACAG TGGCATTTAA ATCCATAGGA 
53301 ATGAATGAAA CCCCTTGTAT TAGGGAATAG AAGAGCAGAT GGCCCAAGAT 
53351 AGGATGCTAA GAAACCTCCG AATATGGAGT TCACATCTCA GTTGTGCCTT 
53401 TGAAATTCTT GTCATCCACT TTTAGTTTTC TTCTCTTCCT ACTTGAAATT 
53451 GCCTACCAAT TTTCAGAGCC CTCTCCTTCC TTTATACCGT CATGAGTTGC 
53501 GCACTTTGCT TATTTTCCTG ATTAAGATCA TAAGCCTCTT AAGGGAAAGA 
53551 TCCTGTAGTC AAAATTACAT TCTTGAATTG AATTGGGTTG . GACTGGAGTG 
53601 GACTGGAGTG ATAAGTATTG TCACATTATA GAATTCCACC CACTGAAGTG 
53651 CAAG TGTTAA ATGTATTAAT ATTTCAAGTT AATGGATACT CTGCCCAAGT 
53701 TTTTAGTTAA TTATTATTAA CTTTCCATTA TAAAAGCTTG II I I I C I I AT 
53751 TAAATCAATC ATC AGATT TA ACGCAGAAAT CAACTCATGT AAACATACAG 
53801 TGAGAGAATT GTATTTTTCT CT AAA IIIIC AGGACATTGA AAAAGACTTG 
53851 AAACAAATGA GGCTTCAGAA CACAAAGGAA AGTAAAAATC CAGAACAGAA 
53901 ATATAAAGCT AAGGTAAGAA ATACTTTTGT CTTTGGGTTC CATATTAAAT 
53951 AGCTGGCTGG GGAGCCACCT TGTGATCTCG GTTGCCTGCA TGATTTTCCC 
54001 CCTAGTATTT TATAGAATTG CTCTATTTTG TGATATGAGA CCAATGGTTT 
54051 TAAGAATCTA TAATGTCAAA CAAAATTGAC CTAGGGAGTT GTAATTTTAA 
54101 GGCI I I I ACT GAATTGCTAA AC I I I I I I I I TTTTTTTGCT TTCTCCTAGA 
54151 AGGGGGTAAA ATTTGAAATT AATTTAGACA AATGTATTTC TGATGAAAAC 
54201 ATCCTCCAAG AGGAAGAGGT ATGCCATTAA GTCTAAATTT CCATTAGTAG 
54251 GTATCAGAAA ATGCAT ATAT CTTAATAGCA TGTTTCATGA AATTATTTCA 
54301 CAGGCTGTAG GGATAA III! TTTCAACTTT TATTTTAGAT TCAGGTGGTA 
54351 CATGTGCAGG TTTGTTACCT GGATATGTTG TGTGATGTTG AGGTTTGGGA 
54401 TATGAATGAT CCCGTCACCC AGGTATTGAG CATAATACCC AGTAGTTAGT 
54451 TTTTCAAGCC TTGCTTCCCT CC I I I C I I AC CCCCACTGTA GTAGCTCCCA 
54501 GTATCTATTG TTGCTATCTT TATGTCCATG AGTACCCAAT GTTTAGCTCC 
54551 CACTTATAAG TGAGAACATG CAGAATTTGG I I I I CTATCC CTATGTAATT 
54601 GGTTTTCTAT CCCTATGTAA TTTGCTTAGG ATAGTAGCCT CCAGCTGCAT 
54651 CCATGTTGCA TGGACATGAT TTCATTCTTT TTTATGGCTG CATAGTATCC 
54701 CATGGTGTAT ATGTACCACA TTTTCTTTAT CCAGACCACC ACTGATGGGC 
54751 ACCTAGGTTG ATTCCATGAC TTTGCTATTG TGAATAGTGC TGGGATGAAC 
54801 ATGTGAGTAT ATGTGTCTTT TTGGTAGAAT GGTTTGTTTT CTTTTGGATA 
54851 TATACCCAGT AATGGGATTG CTGGGTTGAA CAGTAG7TCT AAGTTCTTTG 
54901 AGAAATATCC AAACTGCTTT CTACAGTGGT TGAACTAATT TACATTACAT 
54951 TTCCGCCAAC ACTACATAAG CATTCCCTTT TCTCTGCAGC CTCGCCAATA 
55001 TTTGTTTTTT GACI I I I IAG TAATAGCCAT TCTGACTCGT GTGAGATGGT 
55051 GTCTCATTGT GGTTTTGATT TGTAGTTCTC TGATAATTAG TGATGATGAG 
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55101 TATTCTTTTA TATA I JTGTT 
55151 TCTCTTTCTA TCTTTTGTCC 
55201 CAGTTAAGTT CCTTATAGAG 
55251 GTTTGCAAAT ATTTTCTTCT 
55301 TAGTTTCTTT TGCTGTGCAG 
55351 TCAATTTTGT TTTTGTTGCA 
55401 TTCCCAAGGC CCATGTTCAT 
55451 TCTTATAGTT TAAGGTCTTA 
55501 Mill GTATA CAGTGAAAGG 
55551 GCTAACCAGC TATCCCAGCA 
55601 TTGCTTATTT TTGTCGATTT 
55651 GCTTTATTTC TGGGTTCTCT 
55701 TGAACCAGTA CCATACAGTT 
55751 AAGTTGGGTA ATGTGACGAC 
55801 TTTGGCTATT TGGGCTCTTT 
55851 TTTTTTTCTC CAATCCTGTG 
55901 GTGTTGAATC TATAGATTAC 
55951 GATTATTCCA ATCCATGCAT 
56001 GTATGATTTC TTTCTGTGTT 
56051 TCCTTGGTTA GATGTACTCC 
56101 TTGTAAATGG GATTACGTTC 
56151 GTGTATAGGA ATCCT ATTGA 
56201 TTTGCTGAAG TTGTTCATCA 
56251 GTTTTCAACC TATAGTATCA 
56301 TTTCTTATTT GGATGCCTAG 
56351 TGGTGCTCAA GGCCAGGGAA 
56401 TGGGAGGCTG AGACGGGCAG 
56451 GGCTAACATG GTGAAACCCC 
56501 GGTGTGGTGT CACCCACCTG 
56551 GAGAATCACT TGAACCCAGG 
56601 GTACTGCACT CGAGCCTGGG 
56651 AAAAAGGGAA AGAAAAATGT 
56701 TTTATTCTCC CATTTTATTT 
56751 ACATTTTAAT TTACTTTTCT 
56801 TTAA I I II I A TCATAGCCAA 
56851 CAACTAATAT ACTGACCTAA 
56901 CATTTATTTT AGGTCTTAAT 
56951 TTATGCATTT GGGCTCTCCT 
57001 CAATACAGTA AATCCAAATT 
57051 TTTATCCTGG TTTTATCAGA 
57101 Al I II ICCAT TTTACC Mil 
57151 TGAATAATTA TACCTGATGC 
57201 TAAAACAT7T TCATGTTAGC 
57251 AAAGACTTCA TTAGAGTGGT 
57301 AAATTCCTAA ATGCCCCAGA 
57351 GACCTGCTTC TGAGTGGGAA 
57401 GTGAAGTGAG GTGCTGACGT 
57451 AAAAGTTTTA ACTGCCACTC 
57501 ACAGAAGTGT TCATTTTGGT 
57551 GGGAGCTGGT TAAGAATGTT 
57601 AGCTTGAGAG CACTGTAGAA 
57651 GAAAGTCTAG ATAAAATTAA 
57701 AACATGGAGA AGAAATGAAA 
57751 GTTTTAAAAC TCTATGGATT 
57801 TACTATATAC CTTGCTAATT 
57851 TTCTTTCTTT CAAAATATGG 
57901 TTTAGGCAAT GGATATACCA 
57951 AAGTTTAAGG AATATGAATG 



GGCTGCTTGT ATGTCTTCTT TTGAGAAGTG 
ACTTTAAAAT TTGGGTTGTT I I I I C I 1 0 1 I 
TCTGGATATT AGACCTTTGT TGGATGCATA 
ATTCTGTAGG TTGTCTA7TT ACTCTGTTGA 
AAGCTCCTTA GTTTAATTAG GTTCCACTTG 
ATTGCTTTTG AGGACTTAAT CACAAATTCT 
AATGGTGTTT CCT AGGTTTT CTTTTAGGAT 
CI I I IAAATT GTTAAGTCAT CTTTAGCTGA 
TAGGGGTCCA GTTTCATTCT TCTGCATGTA 
CCACTTATTG GATAGGAAGT CCTTTCCCCA 
TGTCAAAGAT TATATGGCTG TAGATGAGTG 
ATTCTGTTCC TTGGTTTATG TGTTTGTTTT 
TTGATTACTG TAG TCTT A TG GTATAGTTTG 
TCTGGCGTTG TTCTTTTTGC TTAGAATTAC 
TTTGTTTACA TATGAATTTT AGAATAGTTT 
AAAAGTTACA TTGGTAGTTT GACAGGAATA 
TTTGGGCAGT ATGGCCATTT TAATGATATT 
GTGGCATGTT TTTCCATTTG TTTATGTCAT 
GTGTAGCTCT TCTTGTAGAG ATCTTTCACC 
TAGGTATTTT ATTTTATTTT TTGGTGGCTA 
TTGATTTGGC TCTCTGCTTG AATGTTATTG 
TTATTGTACT TCGATATTGT ATCCTGAAAC 
GTTCCAGGAA CCTTTGGGTC GAGTCTTTGG 
TAAGCGTGAA GAGATGGTTT GACTTCTTCT 
AATTTTAGAA AATATTTCTA GAAAAATGTT 
CGGTGGCTCA CAGCTGTAAT CCCAGCACTT 
ATCATGAGAT CAGGAGATTG AGACCATCCT 
ATCTCTACTA AAAATACAAA AAATTAGCTG 
TAGTCTCAGC TACTTAGGAG GCTGAGGCAG 
AGGCAGAGGT TGCAGTGAGC TGAGATCGCT 
CAACAGAGTG AGACACTGTC TCAAAAAAAA 
TTGGTGTTCA AATGAGTCCT CCAAATACTT 
TATTGGTGTT ATTTCTTTAG ATAAATTATT 
TTAAATAAAA GAGCTATTTT ACTCATAATA 
ATTAAAATAG AAGACCTGAT ACATTGTCAA 
AAAATTGAAC AGGTACCCTG AAACCAGGCA 
TAGTT ATTGA TAACTTTAAG TAAATCTCAT 
TGCCACAGCA AGGAGTAAAT ACAGTAAATC 
TCATTTTATT AGTTGATTTC AAAATL II I I 
CCTATAACAA ATGTCAAAAT TAATTGGTTT 
CTGAATTCAC CTTTTAAGTC AATATAAGTA 
TCAGI I I I I A TTTAATGTTC TTTATTAGCT 
Al I ICI I ATT TTTATGAGCA TTTGCTACAT 
GATAGTTAGC ATTCACCTCT GTTCAACCAT 
GGTGAGACAT CAGAGTGGAG CAGATCTGGG 
CTTGAGAAGT GGTACTCTCA CAGAGCTTCT 
TGCCCTGCTG AAATGAAAGA ATGGAGTCCA 
TTTCTTATTC TTTGCTTTGA TCTGCGTGAA 
ATTGACTACA AAATACTAGG AGCAGATTTA 
GTACACTTAA AACAGGGCAT GAATGAGAAA 
TGGAGCTGAA GTGGAATACT ATTGAAGTCA 
GTTGCCTTAT GACCAGTGCT TGACACTGTT 
ACATTTCTGT TTTTATCTAA CATAGCTCTA 
TATTTGTTTA GTAAACATTT GTTGAATATT 
AATTTTACTA GGAACACGAA AATATGGTTT 
CTAATTTATC ATGAAACACT GTGGAATTGA 
AATGAAACTT TGACCTTTGA GGATGGCATG 
TGTAAAGGAG CATGGAGATT ATACAGACAA 
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58001 AGCATTTGAA AAACTTCACT 
58051 TTGTAAATGA GAAGGAACTG 
58101 CCCATGCATC TGCCTCGGCT 
58151 CTAAGCAAGG CAGAAACACA 
58201 CCACACTCCC TGAGATCCAG 
58251 CTTGCAGCTT TGGCTTCTCC 
58301 TTGTCTTCTG GGAAGCTCCA 
58351 GCTGTGTAAA GTCAAGCTCT 
58401 GGTTAGTGTC ACAGATGCAA 
58451 AGAATTTATC AACACAATTG 
58501 CAGAGATACG TTGCAAGAGT 
58551 TCTGCTAAGA GGCAGGGGCT 
58601 GAGCTACATG CTGATAAGGT 
58651 ATGTGGAAGG AATGAGGTAT 
58701 GTGATGAGTC ATCTCTCAGA 
58751 TGGGACCCCT CCCTTTTTGT 
58801 CAGTGCTGTC ACCAGGGTGC 
58851 CTCTGGAGTC TTCCTCTTCC 
58901 TGTCATCTCT CAGTAGTGTG 
58951 ATCTCAGCTG TTGCTGCCAC 
59001 TTATTCCAAC ACTGAATCTC 
59051 AGTAGGAATA ACCACACAGG 
59101 GTATAAACGA CCTGGCACAT 
59151 CATTTTCCCT TTTCCTTCCC 
59201 CCTTCTTTCT CTAGTATTTC 
59251 GTCCATTTTC TTGCTGCTGA 
59301 AAAAGAAAGA GGTTTAATTG 
59351 TCACAATCAT GGTGGAAGGC 
59401 GGCAGCAGGC AAAGAGAAAG 
59451 ACCATCTTAT CTCGTGAGAC 
59501 AGATCCGCCC CCATGATTCA 
59551 ACAGGAATTC AAGATGAGAT 
59601 TACTAAAGTT ATCTTGGCAT 
59651 ATAATACCTG CAGAATAAGG 
59701 ACATGGCCTA ATTCTGCTGT 
59751 TTTCCTTCCA TGGAAAATGG 
59801 TCTCCTACAG CTGGAATGCC 
59851 CTTGGCTCAG TTTCCTCTTC 
59901 GAAAAATCTC CATTCAGCAG 
59951 AGACTGCTCA CTGGTAGGGA 
60001 TCTCAATATC ACAGAAGTGT 
60051 CTGAGCCTGA AATGTTAATC 
60101 CAGGGAGGTC CGCACTCCTC 
60151 TGAGAAGGAT CATGCTGGCT 
60201 CTGAAGGCAG GAGAAGAGTC 
60251 AATAGATACT GAGTGTGGGA 
60301 CCCAGTTTTC TAATTCAAAT 
60351 CACATGGAGG GTAATGGGGT 
60401 GAACATGTTG AGATTTAGGT 
60451 AGACTGAAGG CCTGGGGCAT 
60501 TTAGCATATC CGTGG I I I I I 
60551 GAAGGACTGG AGTGAGAAGG 
60601 ATATAAACAA ACTTTGGAGA 
60651 GACAGGTCAT GGAGGAGGCA 
60701 AGGCCAGATG AGGTGAGTGG 
60751 TCAGGGAAAG CAGTCAACTG 
60801 CATCAGTTTG CTGGGAAAAG 
60851 GAAAAAAAAA CCTTCATCAG 



GCCCAGAAGC AGGTATGTGT TTCTTGAAAG 
TTTTATTAGC AACCCATTTT GAACTCTGTC 
CCACTGTTAC TTGACCCCTT TCTGCCCTCT 
CTTATTATTC TCCTGCCACC CATGCAGTGG 
CCCTCCTCTC CTGCTCCATA CCCACTCCCT 
CAGGAGCTCC AGACTTACCA GTCTTTCTCA 
TGGACAAGTG TTGCCAGTAT CTGAAACTCA 
TCTGTGCTCT TCCCAGTGAC CCTTTATTTT 
CTGGCTGGGG CCAGTGTTGT GGGCAGTAAA 
TAAGTAAAGA AAGGCAGATT TATTAAAGTA 
GCAATGGGCA GCACAGCAGA GAAGAGGCTG 
AGAGGGAAGT TTTATAGGGT CATATTGGAG 
GTGCAGATAA GGTTTTGCTG CTTGGGCTAC 
TTGGGAACAG GATGTGACAG CAGCTTGTCT 
ACAGTTGTTC CCCCATCCCC ACCCCCAACC 
TGTTTACTTA TCTTATGAGA ACTTCACAGT 
ACCCTTAGCA TAGTGTCTAT TCTGAGATGT 
TTCATTCTCT CTGTTACTGG TTTAGGGCTC 
GTGTAGGCTT CAGAGACAGA TGGGAATTGA 
CTTCTGGTTA TGTGACCTTT CTTTCACAAG 
AGTTTCACCT TAGGAACAGG GGATAATAGT 
GTAATTGTGA GGACCAAAGT GAGTTTTGAT 
ACTAGGTGCC TAAATTAAGT GCTGTC I I I I 
CTTGCTGTAT TGCCTTATTT GCTTATGTGA 
CCCTTCATTC TCTAAATGGT TACTGTATTA 
TAAAGACATA CCTGAGACTG AGCAATTTAC 
GACTTACAGT TCCACATGGC TGGGGAAGCC 
AAGGAAAAGC AAGTCACATT TTACATGGAT 
AACTTGTGCA GAGGAACTCC TCI I I I I AAA 
TCATTCACCA TCACGAGAAC AGCATGGGAA 
ACCACCTCCC TCTGGGTCAC CCCACAACAC 
TTGGGTGGGA CACAGCCAAA CGATATCAGT 
ATTATTACTC TGCTCAGATT I I I I 1 I 1 I GG 
TCCATTCCAC ATATTATCAC ATTTAACACT 
GACCCACTTT TCTCATCCCA GCATGGCCTC 
GATCCATACA GCCTGCTGGA ATGCCCATTT 
CATTTTCTCC TACAGCATTT ACAGAACTGA 
CTGGAATACT CTCTGCCTCA TTTCCTTCTG 
GCATCTTATT GAGGATCTCC TTTGTGCCAA 
GCTCAAAGAT GAATGAAATC TGGGCCCTGT 
TATGAGCAAA AAAGTCACAA AACATGTTTT 
ACTGTTTGAA GTGCGAGCTG GGTGGAGAGT 
CAGGGCTTCA CATGCCATCA I I I I I GTGAT 
GCAGAGCAAA GGATGGCATG GAGGGCAAGA 
CAAGTGCATG AGCCAGAGTG GTGCAGGGAG 
ACTGAGGAAG AGAAGGGGCT CAAGGATATT 
GCATGAAGCT TTCATCAACC AAAAATACAT 
CGGGAGAGAC AAGGTAGTGA TCTAAATTTG 
CTATAGAGCA TCAGTTGCAG ATTCTATATA 
ATCAGGGATA AAGATATAGC TTGGTGGCCC 
AACTTTGGTG ATGGTCAAAA TACCTATGCA 
AAATGGAGCT TAGGACATAA CCCTACCACT 
ATCAGGAGAG AGTAAAGCCA AAGGAGGAGA 
CAGGAATTGG CAGCATCAAC TGGAAGAGAA 
GATTTGGCCC TTCAGGAGCC GTTAATGGCC 
TGTAAGGGGT AAATTCAATG GTTATCTTTG 
CAGAGGGGGT TGGCTGTTTT TTAGATGAAA 
TAGTATACTG AAAATTGTCT CTCATTTTAA 
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60901 TCTGTATTCC TGTAATTATT ATTTAGGCTG AAGGA I I I I I CCGTATGTTT 
60951 GTTGACCATT CATATTTCTC CI I I I I I I I I CI I I I I I CTT MM I CI I I I 
61001 Mill IGAGA GGGAGTCTCC CTCTGTCGCC CAGGCTGGAG TGCAGTGGCA 
61051 CAATCTTGGC TCACTGTAAC CTCCGCCTCC TGGGTTCAAG CGATTCTCCT 
61101 GCCTCAGCCT CCCTGAGTAG CTGGCATAGG TGCGCGCCAC CACGCCTAGC 
61151 TGATTTTTAA AATATTTTTA GTAGAGATGA GGTTTCACCA TGTTTGCCAG 
61201 GCTGGTATTT GAACTCTTGA TCTCAGGTGA TCTGCCCACC TTGGCCTCCC 
61251 AAAGTGCTGG GATTACAGGC ATGAGCCACC ACGCCTGGCC AACCCTTCAT 
61301 ATTTCTGTTA TGAATTATGT ACTCATGCCC TTCATCCTTT TTTCTACTGA 
61351 AAATGGCATG TTTG I I I I I I TCTTTATAAG ACTGATTTAA ATCAAACCTT 
61401 TGCCTGTAAT ATGTATTGCA AATGTTTTCC TCAGTTGGTT GTCAGATCTC 
61451 ATTTATAGTA ATAACAGCAA ATATATATGA GTGTGTGTGT GTGTGTGTGT 
61501 GTGTGTGTGT GTGTATTTGT GTATTCATCC ACTTAGGAAT AAATTTTATG 
61551 AGAATTGTGC GGCATATAGA AAGAAAACTG TAAAACCTTA CTGAGGTATT 
61601 TACAGACCAC TTGAATAAAT GGAGAGAAAT AACGGTGCTA TATATTGGAA 
61651 ATA I II I I IC CAAATAAATA TTGCAGTATC GTTGTCTGAG GTATTACCCA 
61701 GAACTCTTTG TCTCACGACC AAAAGAATAA GGAGGGTGGA CAGTAAGGGT 
61751 GAGTTTGGAC CGAAAATTTA ATAAACAAAA GAGGAAAGCT CTTCACTGTG 
61801 GAGAGGGGAC CCAAGAGGGT TGCCATTTCA CAGCTGAGTA CAAAGGCTTT 
61851 TATGAGGAAC CTGATAGGGC TGGGGGTTTC ATTTGCATAA GGCATGAATT 
61901 TCTG GCAGCT CCACCCTGTT CTCCT AGTAT GCTGACTGGC TAGGGGTTGT 
61951 TTTTGGAAAA GGCACCACTC AGAAAATGAC ATGATGGTTG ACCAGGCATG 
62001 GTAGTTCATG CCTGTAATCC CAGCACTTTG GGAGGCTGAG GTGGGCAGAT 
62051 CTCTCAAGGC CAGGAGTTCG AGACTAGCCT GGCCAATATG GCT AAAGCCC 
62101 ATCTCTACTA AAAATACAAA AATTAGCCAG GTGTGGTGGT GCACACCTGT 
62151 AATCTCAGCT ACTTGGGAGG CTGAGCCACA AGAATCACTT GGACCTGGGA 
62201 GGTGAAGGTT GCAGTGAGAC GAGATTGTGC CACCACACTC CAGTCACACT 
62251 CCAGCTGGGT GACAGAGCAA GCAAGACTCC ATCTCAAAAA AAAAAAAAAA 
62301 AAAAATGACG TGGTGTAAAG ACCAGTTGGA GGCTTGGCCC ACAACCAGCT 
62351 GAGTGTTGGA GTGATGGTTC ACAGAGGCTT GGCTCACAGT CCAAAGTATG 
62401 CCCCAAAAAG GAAAGGAATG TGCTCACTGG GGCCCACCAT GTACATGCCC 
62451 ACAAAAGGAG AAGGAACTAT TTGCTAGAGG CCCACTGATT GCACAAAGAA 
62501 CAAA GGCA TT TCTGTGTTGG ACTTTGCTCC CTTATCTGTG CAGCTGTGGG 
62551 CATGTTTTAG GCAAGCTTCC TGTGCTAGTT CCCTTATCTG TGTCTGCAGC 
62601 TTGATTTTTC AGACTGTTCT TTTGTTTGAA AGAATTCTGA GGACCTGCCC 
62651 TAACTGCCTG CCTAACTGAT TCTTTCTTTC TCCTCCCTCA ATATGTGGAT 
62701 TTATGGCTAT TTCAATCAAA ACCACAGTAG GA I I I M I II TTAATGGTAT 
62751 AGGGAGATCT TGGCAGGTTG GAGAATCCTG GAGCTTCTTA AGTGGCCAAA 
62801 AATTTTGAAA AAGAAGAACA GTGAAGTGGT ACTTACATTT CCAAATGTCA 
62851 AAATATATTA CAGAAATTAT AGTCATTCAC ACAATATGAT AGTAGCACCC 
62901 AAATAGTTAA AACAGTGAGA AGAGAAAGTT AGAAACAGAT CCTAGTATGT 
62951 ATCATAATTC AGCACAAATG AAAAGTAACA TCACAAGTCA GCGTGAAAAG 
63001 AAAGGATTAT TCAGATAAAT GCTGCTGGGC CAATTGGTTA ACAGTTTGGG 
63051 GAAGATTGTG AAATCAGACC CTATATAATA TGATACAACA AAATAAATTT 
63101 TTTAAAAAAG AGTTATATGT AAAAAGTTAT ACATTAGAAA ATGAAATAAA 
63151 AGAACATAGG TCA II II M I II II I I II GA GACAGCGTCT CACTCTGTCA 
63201 CCAAGGCTGG AGTGCAAAGG CGTGATCTCG GCTCACTGCA AACTCCGCCT 
63251 TCTGGGTTCA AGCGATTCTC CTGCCTCAGC CTCCCGAGTA GCTGGGACTA 
63301 CAGGCACCCG CTACCACGCC CAGCTAATTT TTATA Mill GATAGAGACG 
63351 GGGTTTCACC ATGTTGGCCA GGATGGTTTC GATCTCTTGA CCTTGTGATC 
63401 CGCCCGCCTC GGCCTCCCAA AGTGCTGAGA TTACAGGCGT GAGCCACTGC 
63451 ACCCGGCCGA GTTAA Mill TTTGAACAGG GAAGAGCTAT CTGTTCAAAA 
63501 TACATAGAAA AAAAAACCAC AGAATAAATT AGTAATAATT CAACTTTAAC 
63551 AACAAAAAGC TGTAATAAAG CAAATCATAC TAACCCCT (SEQ ID NO: 3) 
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FEATURES 






Start: 


2012 




Exon: 


2012- 


2128 


Intron: 


2129- 


10078 


Exon: 


10079 


-10175 


intron : 


10176 


-17356 


Exon: 


17357 


-17454 


Intron : 


17455 


-19144 


Exon : 


19145 


-19228 


intron : 


19229 


-19311 


Exon : 


19312 


-19382 


Intron : 


19383 


-21317 


Exon : 


21318 


-21404 


intron : 


21405 


-25173 


Exon : 


25174 


-25228 


Intron : 


25229 


-27433 


Exon : 


27434 


-27634 


intron : 


27635 


-27711 


Exon : 


27712 


-27793 


Intron : 


27794 


-29849 


Exon : 


29850 


-29983 


Intron : 


29984 


-36496 


Exon : 


36497 


-36678 


Intron : 


36679 


-40418 


Exon : 


40419 


-40485 


intron : 


40486 


-42286 


Exon : 


42287 


-42403 


intron : 


42404 


-43377 


Exon : 


43378 


-43464 


intron : 


43465 


-53832 


Exon : 


53833 


-53913 


Intron : 


53914 


-54149 


Exon: 


54150 


-54218 


intron: 


54219 


-57905 


Exon: 


57906 


-58032 


Intron: 


58033 


-59835 


Exon: 


59836 


-59963 


intron: 


59964 


-60551 


Exon: 


60552 


-60587 


Stop: 


60588 




Chromosome map 




Chromosome 13 
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Allelic variants (SNPs): 



DNA 



Position 


Maior 


Minor 


Domai n 




Minor 










1848 


A 


G 


Beyond 


ORF(5') 


1993 


A 


C 


Beyond 


ORF(5') 


3796 


C 


T 


intron 




4854 


c 


G 


intron 




7649 


A 


G 


intron 




8491 


G 


A 


intron 




8928 


T 


C 


intron 




18844 


A 


T 


Intron 




19055 


G 


A 


Intron 




19607 


G 


C 


intron 




19935 


G 


A 


Intron 




20595 


T 


G 


Intron 




22513 


C 


G A 


Intron 




25121 


T 


c 


Intron 




25883 


A 


G 


intron 




26375 


A 


T 


Intron 




26693 


C 


T 


intron 




30477 


T 


G 


Intron 




30858 


A 


C 


Intron 




31028 


A 


G 


intron 




37308 


- 


T G 


intron 




44213 


G 


A 


intron 




54046 


G 


A 


intron 




56598 


A 


G 


Intron 




56759 


A 


T 


intron 




58510 


A 


G 


Intron 




60356 


G 


A 


Intron 




61643 


C 


T 


Beyond 


ORF(3 ') 
ORF(3') 
ORF(3') 
ORF(3') 


62221 


G 


A 


Beyond 


62306 


A 


- T 


Beyond 


63224 


G 


T 


Beyond 


63350 


G 


A 


Beyond 


ORF(3') 


63517 


C 


A 


Beyond 


ORF(3') 



Protein 

Position Maior 



Context: 



DNA 

Position 
1848 



1993 



TTTCCAGAATTCCCCCTAGGGGGCAGTACGTCCCCACTAAGAAAGGCTGAACTATAAAAG 
TGCA(^GCCTAAGGACATTCCTGCTTTATAAAGGTGCGAAA(^CCGGATATAGTATCTT 
TCATT CTCA GAACAAACTTGCAAAACAGGTATTGTTATTCCATTTTAGAAATTAGGAAAG 
TGAGGTTTTGCCAGGTTAAGTGACTTACCCGAGAATACAGGGCAAAAGTGTATCAAAGCT 
GAGCTATGACCCGTGTCTGACCAAGAAACTCTGTCTCATTTCAGTTATCTGTGGCCACAA 
[A,G] 

GAAAGTTATTTGTCTCTGTCTTGGCAAGGCTGGGAGGAAAG7TTTAGCTAAGTGAGTTCT 
TTTACACTTTAGTCATCAGTTTTCTGACTTTGTTAGTCTTTATGAGACGTGTGTGATAAA 
TTTACATTACTCTAATTCCAGGAAACTCAGCCCATTGGAGACCATGGATAAGTACGATGT 
GATTAAGGCCATCGGGCAAGGTGCCTTCGGGAAAGCATACTTAGCTAAAGGGAAATCAGA 
TAGCAAGCACTGTGTCATAAAAGAGATCAA7TTTGAAAAGGTAAAGTTAAGTTCAAATTT 

CAGGTATTGTTATTCCATTTTAGAAATTAGGAAAGTGAGGTTTTGCCAGGTTAAGTGACT 

TACCCGAGAATACAGGGCAAAAGTGTATCAAAGCTGAGCTATGACCCGTGTCTGACCAAG 
AAACTCTGTCTCATTTCAGTTATCTGTGGCCACAAAGAAAGTTATTTGTCTCTGTCTTGG 
CAA GGCT GGGAGGAAAGTTTTAGCTAAGTGAGTTCTTTTACACTTTAGTCATCAGTTTTC 
TGACTTTGTTAGTCTTTATGAGACGTGTGTGATAAATTTACATTACTCTAATTCCAGGAA 
[A,C] 
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CTCAGCCCATTGGAGACCATGGATAAGTACGATGTGATTAAGGCCATCGGGCAAGGTGCC 
TTCGGGAAAGCATACTTAGCTAAAGGGAAATCAGATAGCAAGCACTGTGTCATAAAAGAG 
ATCAATTTTGAAAAGGTAAAGTTAAGTTCAAATTTCTGTTAATTTTCAGTGGGATATTCA 
GCTGGC I I I I AATC CAAT AT AAAAAG GAAA I I I t IAI I I I I I ATAATTTCGAATTTTAAG 
CCATAATTGA I I I I I GTTAATTCAACCTCCTAAGTCCATTGTCCAAACAGCAACCAATGA 

3796 TTACCACAAAAAATTGCAAGAAAACCTACCCAAACTTAAAGCTCAAGAGTAGATGACTGG 
CTTCCAGGGATAATGATTTATTTC CCAATATAGGTCTCTTTTTGTG AATCCATGG CATAT 
TCATAATAATGTCCTCTTATTCTAGTGGCCCGCAATAGCTTCCTCCCATGACATTATTCT 
GCTCACTCTCI I I IGTTTATCTGACTGCTCTCCCTCAGGCTTATCTCTGTCTTCGCCCTG 
TGTATGTCCTCAACCATGTGTCU I I ICTGAI I I Id I I I I CTGTCCATTGTCACCTAAA 
[C.T] 

TGCCCCACTTCAGTGTTTACCAATAAGTAGATCTCTCTTAAATCTCTGTCTCTACCCCTG 
GCATCTTTCAGTACCCTAGTTCTGCATTTCTTCTGCCAGCTAGATAACTTCAGGTAATAT 
CTGTGGTTTTGTTTTGAGGTGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGC 
CATCTCGGCTCACTGCAAGCTCTGCCTCCCAGGTTCATGCCATTCTCCTGCCTCAGCCTC 
CCGAGTAGCTGGGACTACAGGCGCCCGCCACCACGCCTGGCTAAI II I I IGTA I 1 1 I I AG 

4854 CTTGAATTTGCTTCAGTCL I I I I GTCGAGGCCCTGGGTCACTTGGATCCTTCAAGTGGCT 

CCAGCCCAATTTTGATAATGCTCCAGCCATGCCCCCAAACCTTCACTGGGACAGAGGCTG 
TAAAGAAAGAGTTGCCTAGGTTTGACTACATAAAAATAGAAAACGTTTGTATGTCAAAAC 
AAACACTATAAATAAATTCAAAGAAATCGAGAAGGTGCCAAAAATATTTGCAAGTATTGA 
CTTAATGGTGTTAGCCTTTTATTAAATCAATAAAAAGATAAAATCCATATATGAAGTCAT 
[C,G] 

GTACAAAAATTTGAAACTCAGTAGAAAACTAAGAAATTAGGAGTTTATTCAAAGAAAAAC 
CCCACAGATAAACAGTTAGAAAACAAATGTCCAACAGTAGGTAATTTGTTAAGTAATTTA 
TAAAAAACTAAGTGGCTATTAG CAATCATGTTGTAGGTG AAGCATTG ACATG GGAAAATT 
TCAATGTTTGCAATGTTTGAGAAAATAGTAAGTGTAAAATAATATAATCTTTGGAAAAAT 
ATATATATTCTCCATATATATGTATACCTACAAATATGTTCATATATGTACAAAGAAAGA 

7649 CATGATACTACATTGCCTCCCTTTA I I I I I AATGACTGCATAGCATTAAAGTGGTAGCAG 
GTCAAAAATACCATAATTTAGCTGGGCATGGTGGCAAGTGCCTGTAGTCCCAGCTATTCT 
GGAGGATGAGTTGGGAGGATCCCTTGACCCCAGGAGTTAAAATCCAGCTTAGACAACATA 
G C AGAACTCTGTCTT AAAAAAAAAAAAAAAGCTAGCAAAAC ACCCCTGTAATTTATTTAA 
CTCI I M I CTATTTTCAGATAATTACATTGTTTGGTTGG I I I I I I GGCTACGATTCAATA 
[A,G] 

CATTTAATATGTAAAGTATGATTCA I I I I I ATTAAACAAAACTATGTATATATGCTTGCC 
TATATATGCATGAAATAAAAAGCTCTAACTATTAACAACAGTTATCCCTAGGGAATATAG 
TATTAGGTTGGCGCAAAAGTAATTG CATTTTTGCCATTAAG AGTAAG GTTACCAC CTATG 
GGCTTTCGTCTGTGGGCTAGATGAGAAAGAAAGAGGGAAGTTTCACTTTTACCTTATTCA 
CTTCT ATTTGACTTAAAAC AAGCG TG CATTATTAG AGTAACTTAAAAACTAG CAATAAAA 

8491 GCCTGTGGTCTCAGCTACTCAGGAGGCTGAGGTGGGAGGATCACTTGAGCTCAGTGGGCA 
GAGGCTGCAGTGAACCAAGATCATGCCACTGCACTCCAGCCTGGGTGATAGAGCAAGACC 

TTGTCTTAAAAAAAAAAAAAAAAAG ATTTCTTCAG CAGG ATACAG AC C CC CCACAAAAAT 
G AA CATTTT AAAGATT CAT ATT AT AT ATTG T AAAA CTGC CTT CCCA G AAAT ATTTT ATCA 
ATTTGTGTAGTTTTAC CAG AAATAAATG AGTGTC CATTTTG CTGCTTTCTGG CCAATAGT 
[G,A] 

GTTATTGACATTCTTTTCATCTTTGCCAGTTTCATACATGGAATACTATATTACATTTTG 
TTTTAGCTTTTATTCC I I I I I I I I I I I I I I I GCAATGGAGTCTTACTCTGTCACCCAGGC 
TGGAGTGCAGTGGTGTGATTTTGGCTTACTGCAGCCTCCATCTCCCAGGTTCAAGGGATT 
CT CCTG CCTCAGCCTCCTGAGTAGCTGAGACCACAGGTGTGTGCCACCACGCCTGGCTAA 
I I I I I IGTGI I I I I AGTAGAGACAGGGTTTTGCTATGTTGGCCAGGCTGGTCTTGAACTC 

8928 GATTTTGGCTTACTGCAGCCTCCATCTCCC AGGTTCAAGGGATTCTCCTGCCTCAGCCTC 

CT G AGTAGCTGAG ACCACAGGTGTGTGCCACCACGCCTGGCTAA I I I I I I GTG II I I I AG 
TAGAGACAGGGTTTTGCTATGTTGGCCAGGCTGGTCTTGAACTCCTGGCCTCAAGTGATC 
TGCCTGCCTTGGCTTCCCAAAGAGCTGGGATTACAGGCATGAGCTACCACACCCAGCCAA 
ATTTTG CTTTAGTTTTTATTCCTTTG ATTACTG C ATG AG ATTG AAT ATTTTTTCT ATCAG 

[T,C] 

CAI I I I I ATTTCTC I I I I I II I I CGAGTTGACTATTCTTGTACTTTGCTA I I I I ICTGTT 
GGGGTGTTTGCC I I I I I AAAAATTATTTGCCATCAA I I I I I ATATTATAAATATATTTGT 
CATAT ATGGTACAAATATTGTATCTTATCCTTTTGTTTGTCTTTTAATTTTGTTT AT AAT 
ATTCI I I I AAATAAATAGTAGTTAGGAA I I I I I I AAGTTGCTAAATGTATCCAGCTGGTA 
GG AGTAATTTAG CTGTITTTGTTTTG AAACTCCTATGTACT G ACTAT ACAATTT AAATTG 

18844 GAACACATTTTCATAAAGAGGCTCATGGTCAACAAAGATAAAATCAAATCATGACTTAGA 
AATAAAACTAAACTTCAAAGGTAAAGTATTTGTTGGTTTTATATTAGATATACTGATATT 
TrATTACAATTCCTAACCTCACAGATCCCCCATTTCTTCCTCTTTCTCTCCCCACCCTTG 
TCACCCTCCTTCCACTGTAAAGGAAGAACCAATGGCTCCCAGGTTATCAGGAAACAGGGC 
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TG CTTGT6TACT ATTCACG ATG CAGTTAGC ACCCCAGGGTT AAGTAGG AAAAAAAGAAAA 

[A,T] 

CATGAACGG CATG C CTCTTT C C CTTG CTTCT ACTT ATCTTTTT C TG CATGTGG AATTTC C 
CTT GATTTTACCAGTGATATTTGG ATTAC I I I I CTGTGCCTCCA I I I I I I I AGTTGTAGA 
ATGAAAATAATAATATGATAAAGTGTACCTATTAACTTCATTCCTATAAATACACATACA 
CTATATGTGTGTATGTGTGTGTGTGTGTGTGTATAATTTCTATC I I I I I GCATGTTACCA 
TGAAGACATTTCAGTGACT ACCAGGCT ATTCAGTGGCTTTGTTTTGTGTTCTCTCTATAG 

190 5 5 ATG G CTC CCAGGTTATCAGGAAACAGGGCTG CTTG TGT ACT ATTCACG ATG CAGTT AG CA 
CCCCAGGGTTAAGTAGGAAAAAAAGAAAAACATGAACGGCATGCCTCTTTCCCTTGCTTC 
TACTTATC i I I I ! CTG CATGTG G AATTTC C CTTG ATTTTAC CAGTG AT ATTTG G ATT A CT 
TTTCTGTGCCTCCA I I I I I I I AGTTGTAGAATGAAAATAATAATATGATAAAGTGTACCT 
ATTAACTTCATTCCTATAAATACACATACACTATATGTGTGTATGTGTGTGTGTGTGTGT 
[G.A] 

TATAATT TCTAT l I I I I 1 GCATGTTACCATG AAGACATTTCAGTG ACTACCAGG CTATTC 
AGTGGU I IGI I I I GTGTTCTCTCTATAG ATC CTCGGTTGGTTTGTACAGATTTCTCTAG 
GACTAAAACATATTCATGACAGGAAGATATTACACAGGGACATAAAAGCTCAGGTAACAG 
CTCA GAGAGAAGACTAAGACAGAACTGATCTTTTCTTGAAGTACCTCAAACAACATGACA 
TTTTCTCCATTT ATAGAAC ATTTTTCTTAG CAAG AACGGAATGGTGGCAAAG CTTGGGG A 

19607 TATAGAACA I UNCI I AG CAAGAACGGAATGGTGGCAAAG CTTGGGG ACTTTGGTATAG 

CAAGAGTCCTGAATAAGTAAGTACTTTGAAAATAA I I I I 1 CTTTCTAGTCAAAATAGCCC 
AAATATGTATTTTTAGATATCATGGATTAAGAAGATATTAAAATCTTGGTTGTCTAAATA 
ATTTT AG G T AGC TTT ATGT AAATG CATT A CATCAG ATGGTA CTTTG AG ATT AAAATTCT C 
AAG ATAAATTGTGGTGTAATAG AATGATGTTG CTAATATTCTGTAGTGTGATTC CAGTTT 
[G,C] 

TCAAATATGGATGTGACTGTAATATGCATAAAGCTAGAGAGAATTTCGTGAAATAGGCAG 
GTTT ACA CTTCTTAATG AAAAAAG TOW\ C TCT AT AAAAT ATTTG AAG A GATTT ATTCTG 
AGCCAAATACGAGTGACCAAAGGTCCATGCCTGTGACATAGCCCTCAGGAGATCCTAAGA 
ACATGTACCCAAGGTGGCCGGTCTACAACCTGGTTTTGTACATTTTAGGGAGATGCAAGA 
CATCAATTAGATGTACATGGGTTTGGTCCAGAAAAGCAGGACAACTCAAAGCTGGGAAGA 

1993 5 ATAAAGCTAGAGAGAATTTCGTGAAATAGGCAGGTTTACACTTCTTAATGAAAAAAGTCA 
AACTCTATAAAATATTTGAAGAGATTTATTCTGAGCCAAATACGAGTGACCAAAGGTCCA 
TGCCTGTGACATAGC CCTC AGGAGATCCTAAGAACATGTACCCAAGGTGGCCGGTCTACA 
ACCTGGTTTTGTACATTTTAGGGAGATGCAAGACATCAATTAGATGTACATGGGTTTGGT 
CCAGAAAAGCAGGACAACTCAAAGCTGGGAAGAATGGGAGGGAGCTTCCAGGTCATAGGT 
[G,A] 

GATTAAAAACTTTTCTGATTGGCAATTGATTGAAAGAGTCTATCTGAAGACCTGGAATTA 
GTGGAAGGGAGTGTCTGGGTTAAGATAAGGGGTTGTGGAAATGAAGG 1 I I I I ATTATGCA 
GATGAAATCTCCAAGTAGCAGGCCTCAGAGAGAATAGATTGTAAATATTTCCTCTTATCG 
G ATTTAAAAAGG TGCC AG A CTCTTAGTTAAC Mill CCTGGATCAGGAAAAAGCCTTGGA 
AAAAGAAGGGAA I I I ICI I CAGAATGTAGATTTTCCCCACAAGAGATACCTTTGCAGGAC 

20595 GCCTGCTATCTGTCATGTGATGTTATACTAGAGTCAGGCTGGACTTTGGTATCTTATTGC 
TACAAGGAGTCTGCTTTGTCAGTCTTAAGGTCTGTTTTAATGTTAATGCTGGTCAACTGT 
GCCTGAATTCCAAAGGGGAGGAGGAGTTAATGAGGCATATCAGACCCTGCTTCCCATCAT 
GGCCTGAACTAG I I I I I CAGGTTAACTTTGGAATGTCCTTGGCCAAAGGGAGGGTTTATG 
AG TTG GTT G G G G G G CTT AG AATTTT ATTTTTG GTTT ACA CA CTTTCT AG CAAAAT AAATT 

[T,G] 

GTG CACCTGTTTGG AAGAC AATTTGGTGG C AATATGTACCAAGAG ATTTTTAAATATC CT 
GTTTCTGGGACTTCTTCCAAGGGAATAATTTGAAATTTGGAATAACGTAAATGCCTAAAT 
AATTGGGAAATGGTTAAATTTAATAAAGCTTGGCATGGCCATGGCCATGTACCTGAATAT 
ATCATAAACATTTATGGTTTTGAAGACTTCTTGATAACTTTGTTATACTAAGCAAAGAAA 
ATGGAATTCTGAATTTTAAATACATTGTGATCACGGTTATATGAAAAATATGTGTGGAAA 

22513 AGCGATTCT CCTG CCTCAGCCTCCTGAGTATCTGGGATTACAGGTGGCTGCCACCATGCC 
CAGCTAAAGTTTTGTG I I I I I AGTAGAGATGGAGTTTCACCATGTTGTCCAGGCTGGTCT 
CAAACTCCTGACCTCACGATCTGCCTGCCTCAGCCTCCCAAAGTGTTGAGATTACAGGTG 
TGAGCCACTGTGCCTGGCTGGGCACTGGC I \ I I I AAGCTTTTAATATTTAACTTCACCAC 
TCAGTTAGTATAGAAACAGTTGTGATGGAGGCCTGCATTGGTAAGACCTGGCCTGCCACA 
[C,G,A] 

AATGGGGATCCCAGTGACTATCTCTGAGCAGTGTTACCTGAAGGTTTCAAACTTGTTTAG 
AAGAAAGCCATTTCTCTTCATTTAAAGATACAAGTGGTATAAAAAATAACATCGAAAATT 
GCAGTCACTGTGATGTCCA I I I I I GTATTATATGTTCATATCTTTGAAGCACTGTTTAGT 
CTATTGCAAGAAAGATTGAAGAGGATGAAGTAGAAGACAATGTGGTCTGGTGACCGCTCA 
CTGGATTAGGAGCTAGGAATCCTAGTCTTGGCTCAGTTGCTAACTTGACCAAGTCAGTTG 

25121 CTTTTGAGTCAAAACAAATCCATGTTTGAGTTCTGCCTGCCTCCTCCAAATTGCTCAACA 
TTTCATCATACATACATTG Mill GAGCAGGAAGCTGAACTAAATATTAAGCCACCAGGT 
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TGTAGCAAAGTTTGTGTGCCTTTCTTTGACTAGAAATCTGACAAACTACAAATGGTTTTC 
ATTTT AC C TCTT ATCTT CT AAT AAGAATTG ATGAT AT ATCTG AAAG CATTTG T AAAAG CT 
GATCAACTTACATAAAATTGTAAAG CGACACAAATTTAAGG CACTGTAAGGATAAAAG CT 
[T,C] 

TT ATT AAGAATTATGGAT ATTTT CTTGGCATGTAAACTCTTATCTTCTTTAGGGATATTT 
GGTCTCTTGGCTGTGTCTTATATG AG CT CTGCACACTTAAACATCCTGTAAGTATGCTCA 
TTGTCAGACTAATCTTGAATTATTGGAATTGT AG AAAAG AAATTAACTTCTGGGAGAAAA 
AGGTTAATGTTTGGTTTTATTAGATTGTTAAAAATTATATGGATAAGCTACTTAAAATAA 
TG ATAGATG ACATGGAAAG CTGTC CAAGCAATATTATAAAGTAAAAAGTCCAAGTTGGAG 

25883 TACTAAAAATACAAAAATTAGCCAGG CATG GTGG CATGTGC CTGCAGTCCCAGCTAGTTG 

GGAGGCTGAGGCACGAGAATTGCTAGAACCCAGGAGGCAGAGGCTGCAGTCAGCTGAGAT 
TG CGC CA C TGCA CTCCAG C CTGG GTG ACAG CG AG ACTC CAT CT AAAAAAAAT AATT AATT 
AATTAATTACTGTATGAATAGATACGTTCAGCAAAAGAAAAATGTACATGGGCAAAGTTC 
ATAGGAAACCAGGCACAAGC I I I I AAGAGTCTTTTCCCAGAGGTCACATGGGATGTGCCA 
[A,G] 

ATCCTCCAGCATTGTTACCCACGTCACCTGTGAAATGTGATCTATAAGAAAGCTCATCGG 
ATATACC CAGT G CCCAGGATTTTT ACTGGG GACTGGTCACATAGGCACCCTCTAC CTGGC 
AT ATGCCAAACTTCCAG ACTC CTGG AAAG AAAG CCCGTGTTCAG CAT AAACCATTTTGTT 
CACATAAATAGCTGAGGCAAAGATAGCCACTCTTGACATTCAGGGAATGGTGGGAATTCT 
TCTGAAATCTTAGTTCCCAGACACCAGCCACGGGCCAACATTGTAAGCAGGCCTTTCTGA 

26375 CTGAGGCAAAGATAGCCACTCTTGACATTCAGGGAATGGTGGGAATTCTTCTGAAATCTT 
AGTTCCCAGACACCAGCCACGGGCCAACATTGTAAGCAGGCCTTTCTGAGGAGAGCTTGC 
TACATCAACTC I I I I CTCCACAGCTGTCATCATTGTTATTAATTATTGTCAAGGGTTGCA 
CAGCCAGTGTCTGACCAAAATGTGTACTCCATTG I I I I I I I GAGATGGAGTCCCGCTCTG 
TTGCCCAGACTGGAGTGCGGTGGCACGATCTCAGCTCACTGCAACCTCTGACTCCTGGGT 
[A,T] 

CAAGCA ATTCTC TTGCCTCAGCCTCCCGAGGAGCTGGGATTACAGGCACCCACCACCACA 
CCCGGCTAAI I I I I I IGTAI I I I I AGTAGAGTCAGGGTTTTGCCATGTTGGCCAGGTTGG 
TCTTGAACTCCTGACCTTGGGTGATCTGCCCACCTTGGCCTCCCAGAGTGCTGGGATTAC 
AGGCGTGAGCCACCATGCCCGGCCAATGTGTACCTTTATTGCTACACCATGGAGTTGAAT 
ATTATTATGTATAAATAACTATTG GTTTCATACAATAG AAG ATTTCTGGTCTATG AAG CA 

26693 TCAGCCTCCCGAGGAGCT GGGATTACAGGCACCCACCACCACACCCGGCTAA I rTTTTTG 

TAI I II I AGTAG AGTCAGGGTTTTGCCATGTTGG CCAGGTTGGTCTTGAACTCCTGACCT 
TGGGTGATCTGCCCACCTTGGCCTCCCAGAGTGCTGGGATTACAGGCGTGAGCCACCATG 
CCCGGCCAATGTGTACCTTTATTGCTACACCATGGAGTTGAATATTATTATGTATAAATA 
ACTATTGGTTTCATACAATAGAAGATTTCTGGTCTATGAAGCATTTTAGAGGAAATTAAA 
[C,T] 

GATGTTTATGTTAATTTTAAAAAGCAAGAGATAAAATTTCATATCAATATGACCTCAACT 
TTGTAAAATAAACATCATTTTTAAAAGAGATCAG AAGGAG CTATAC CTCTG AGTG GTAAA 
ATTATACATATTTTCCCCrGTCTTTATAACTTCCTATACCTTCCAG I I I I I I IATTATGA 
GTAAACATTATTTTGATAATAAGACAGAATTAAAACAAAATAAAAACTTGTTTTAAATAA 
CATGGCATCTTGTTGAATAACTG C AGTATCTG CTCATGAAAG ATTAGTTGATGAAAACAA 

30477 TACAGACGTGTGCCACTATGCCCAGCTAA I 1 I I I GTA I I I I I GGTAGAGATGGGGTTTCA 
CCATATTGGCCAGAATGGTCTCCATCTCTTGACCTCGTGATCCACCTGCCTGGGCCTCCC 
AAAGTGCTGGGATTACAGGTGTGAGCCATGGCGCCCGGCCCCGGCTAA I I I I I ATACTTT 
TAGTAGAGACAGGGTTTCACCATGTTGGTCAGACTGGTCTCGAACTCCTGACCTTGCGAT 
CAGCCTGCCTCGGCCTCCCAAAGTGCTGGTATTACAAGCATAAGCCACTGCACCCAGCTG 
[T,G] 

TATATTC I I I I I CTTTAA I I I II I AATTAAAAAAAAAA I I I I I GTGGGTACATAGTAAGT 
GTATATATTTATGGGGTATATGAGATG I I I I GAT ACAG GCAAGCAATGTGAAAT AAG CAC 
ATCATGGAGAATAGGGTGTTTGTCCCCTCAAGTATTTATCCTTTGAGTTACAAACAACCC 
AGTTATACTCTGTAACTTATTTCAAAATGTACAATTAAGTTACTATTGACCATAGGCAGT 
CTATTGTG CTATCAAATAGT AGGTCTTATTCATTCTTTTGTTTTTTTAACC CATTAAGCT 

30858 TG AGATGTTTTG ATACAGG CAAG C AATGTG AAAT AAGCACATCATGG AG AAT AGGGTGTT 

TGTCCCCTCAAGTATTTATCCTTTGAGTTACAAACAACCCAGTTATACTCTGTAACTTAT 
TTCAAAATGTACAATTAAGTTACTATTGACCATAGGCAGTCTATTGTGCTATCAAATAGT 
AG GTCTTATTCATTCTTTTGTTTTTTTAAC CC ATTAAGCTATG GTATATTCTG ACAG ACC 
TATCTGCACATGTTCATGAGGTACAAGCTTATTGTTTGGAGTCCACAAATTTTGTACTTA 

[A,C] 

AATGAAGTATTCTGTACTGAGCATTATAATGGTATTTTGTTGGACAACTTCTAG I ITTTA 
T ATTTTATG AAA CAATG CT G T ATG CT CTT AT AAG T AT ACTTT AG G C TT AATTTTC I I I I I 
ATAACTGAAATTCTTCTAATTTCTAATAAATAAGA I I I I I CTGTATAGGAAAAGTGAGTA 
ACATAGCAACAGAAAACACTCTGCATTTAATATTCTTAATTCTAACATATTATGTATAGG 
ATTGAGAAGI I I I I ATGATATAATAATTGATATTTCCCTAGTGATTCTTTGTGTTTAATT 

31028 ATCAAATAGTAGGTCTTATTCATTCTTTTG I I I I I I I AACCCATTAAGCTATGGTATATT 
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CTGACAG ACCTATCTG CACATGTTCATG AG GTAC AA6CTTATTGTTTGG AGTCCACAAAT 
TTTGTACTTAAAATGAAGTATTCTGTACTGAGCATTATAATGGTATTTTGTTGGACAACT 
TCTAGI 1 I I I A TATTTTATGAAACAATGCTGTATGCTCTTATAAGTATACTTTAGGCTT A 
ATTTTCTTn I ATAACTGAAATTCTTCTAATTTCTAATAAATAAGA I 1 I 1 \ CTGTATAGG 
[A,G] 

AAAGTGAGTAACATAGCAACAGAAAACACTCTGCATTTAATATTCTTAATTCTAACATAT 

TATGTATAGGATTGAGAAGTTTTTATGATATAATAATTGATATTTCCCTAGTGATTCTTT 
GTGTTTAATTATTTGAATTCACTTCAGCAGAGTGTTGAATCTTTTAGGTCATACTAGTGA 
AATGCTTCTGGTATGTAAATGATAAAATGGCTACTGTC I I \ I AATTAAAGAATTGTATTT 
TTAAAGAAGGCTCATGGTTAAATTAAGAACCATTTGGAAGTGTATTTACTAAGTGTTTAC 

37308 TAGTAAAGACAGGGTTTCACCATGTTGGCCAACCTGATCTTGAACTCCTGACCTCAGGTG 
ATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCATGCCTGGC 
TCCATCTCCCAAGAAGGTCTCTAGAGTTGGCACAGATCACTGCTTCTTCAGAAGAGCTTC 
CATGTTAGTCC CTTCTTTCTATGTCAGCC CTATACCTGCTTGTTAGTTGGTTCTTCAAAT 
TCTCAGGTACCCTCTCACCAGGCAGCCACTGACCTCATGTGATCCACCTGCCTTGGCCTC 
CTAAAGTGCTGGGACTACAGGCAAGAGCCACTATTCCCAGCCTTTC I I I LI M I I I 1 I I I 
[- iT f G] 

TT AGAAAG ATTTTGTTTTT ATTTC CATCAG AATG TCATAT ATGTT A CACAAATCAAATCT 
GTTGACATCTCAAGCTTATAACAATTACGTGTTCTTATAAATTACGTGGGAATTACATGT 
ACTGTGAG AAGTGTTGTAATTATGATGT AATGTATATTATAATTTAGCCT ACAGAAGTAA 
CAAAGTCTTGT AATTAAAT AAAG C AAT AAATGTG TTGAT AG ATTATT ACAATTG AT AAG T 
AATTGATAAATTATCTTC I I I I I CCTGTAACCCTTCTTCATCTCAAGTCTGATCTAGCTT 

44213 GAGTGCAGTTGAACTGATTCTTAAAAATCTCTTAAAGGCGCCACATTGGAAATTCATCCT 

ACATG AAATAG CCAG CCTGTGTTTGG AAACCATGTTGTAAGAAAG ACACATG GCTATTG A 
AACACTAGGAAC ACACTCAGTG CC CTGGAATGCTCTCCTAG GAGAAG CTTGCAGGCACTG 
AGACAGCTGTCTCCCATCCCACATGCACTTGGCCACACACTCATTGAGTAGAGCTACCAT 
[G,A] 

CTG CTG AAATTGATCTCTCTCTCTCTTTCTCC CAC CG CAGTG CATACAG ATAAATTCATA 
TAAGTCAAATG AATGTATG GTG CAATTCAGTTGTGTTTG CC AGGCC ATG AACTAG AGCTT 
TCACATACTGTATTAGTCTGCTCTCATACTGCTAATAAAGACATACCCAAGACTGGGTAA 
TTTATAAAGAAAAAGAGGTGTAATAGACGCACAGTTTCACATGGCTGGGGAGGCCTCACA 
ATCATGGCAGAAGGCAAAGGAGGAGCAAAGTCATGCCTTACATGAAGGCAGGCAAGAGAG 

54046 GTTATTAAATCAATCATCAGATTTAACGCAGAAATCAACTCATGTAAACATACAGTGAGA 
GAATTGTATTTTTCTCTAAATTTTCAGGACATTGAAAAAGACTTGAAACAAATGAGGCTT 
CAGAACACAAAGGAAAGTAAAAATCCAGAACAGAAATATAAAGCTAAGGTAAGAAATACT 
TTTGTCTT TGGG TTCCATATTAAATAGCTGGCTGGGGAGCCACCTTGTGATCTCGGTTGC 
CTGCATG ATTTTC C CCCTAGTATTTTATAG AATTG CTCTATTTTGTG ATATG AG ACCAAT 
[G,A] 

GTTTTAAGAATCTATAATGTCAAACAAAATTGACCTAGGGAGTTGTAATTTTAAGGCTTT 
TACTGAATTGCT AAACTTTTTTTTTTTTTTTG CTTTCTC CT AG AAGGGGGTAAAATTTG A 
AATTAATTTAGACAAATGTATTTCTGATGAAAACATCCTCCAAGAGGAAGAGGTATGCCA 

TTAAGTCTAAATTTCCATTAGTAGGTATCAGAAAATGCATATATCTTAATAGCATGTTTC 
ATGAAATTATTTCACAGGCTGTAGGGATAA I I I I I I I CAACTTTTATTTTAGATTCAGGT 

56598 TCTTTTCTTATTTGGATGCCTAGAATTTTAGAAAATATTTCTAGAAAAATGTTTGGTGCT 
CAAGGC CAGGG AACGGTGG CTCAC AG CTGTAATC C CAGCACTTTGGGAGGCTG AG ACGGG 
CAGATCATGAGATCAGGAGATTGAGACCATCCTGGCTAACATGGTGAAACCCCATCTCTA 
CTAAAAATACAAAAAATTAGCTGGGTGTGGTGTCACCCACCTGTAGTCTCAGCTACTTAG 
GAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAGGCAGAGGTTGCAGTGAGCTGAGATC 
[A,G] 

CTGTACTGCACTCGAGCCTGGGCAAC^GAGTGAGACACrGTCTCAAAAAAAAAAAAAGGG 
AAAGAAAAATGTTTGGTGTTCAAATG AGTC CTCC AAATA CTTTTTATTCTCC CATTTTAT 
TTTATTGGTGTTATTTCTTTAGATAAATTATTACATTTTAATTTACTTTTCTTTAAAT^ 
AAGAGCTATTTTACTCATAATATTAATTTTTATCATAGCCAAATTAAAATAGAAGACCTG 
ATACATTGTCAACAACTAATATACTG ACCTAAAAAATTG AA CAGGTACCCT G AAACCAGG 

56759 TGGTGAAACCCC ATCTCTACTAAAAATACAAAAAATTAGCTGGGTGTGGTGTCACCCACC 

TGTAGTCTCAGCTACTTAGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAGGCAGAG 

GTTGCAGTGAGCTGAGATCGCTGTACTGCACTCGAGCCTGGGCAACAGAGTGAGACACTG 
TCTCA AAAAAAAAA AAAG GGAAAGAAAAATGTTTGGTGTTCAAATGAGTCCTCCAAATAC 
Mill ATT CTC C CATTTT ATTTT ATTG GTG TT ATTTCTTT A G AT AAATT ATT AC ATTTT A 

[A,T] 

TTT ACTTTTCTTT AAAT AAAAGAG CT ATTTT A CT CAT AAT A TT AA Mill AT CAT AG CCA 
AATTAAAATAGAAGACCTGATACATTGTCAACAACTAATATACTGACCTAAAAAATTGAA 
CAGGTACC CTG AAACCAGG CACATTTATTTTAGGTCTTAATTAGTT ATTGATAACTTTAA 
GTAAATCTCATTTATGCATTTGGGCTCTCCTTGCCACAGCAAGGAGTAAATACAGTAAAT 
CCAATACAGTAAATCCAAATTTCATTTTATTAGTTGATTTCAAAATC I I I I I 1 I A TCCTG 



FIGURE 3BB 



5 8 5 10 CTGAGATCCAGCCCTCCTCTCCTGCTCCATACCCACTCCCTCTTGCAGCTTTGGCTTCTC 
C C AGGAGCTCCAGACTTACCAGTCTTTCTCATTGTCTTCTG GG AAG CTCCATGGACAAGT 
GT TGCCA GTATCTGAAACTCAGCTGTGTAAAGTCAAGCTCTTCTGTGCTCTTCCCAGTGA 
CCCTTTATTTTGGTTAGTGTCACAGATGCAACTGGCTGGGGCCAGTGTTGTGGGCAGTAA 
AA G AATTT ATCAACACAATTGT AAGTAAAG AAAG G CAGATTT ATT AAAGTACAG AG ATA C 
[A.G] 

TTGCAAGAGTGCAATGGGCAGCACAGCAGAGAAGAGGCTGTCTGCTAAGAGGCAGGGGCT 
AGAGGGAAGTTTTATAGGGTCATATTGGAGGAGCTACATGCTGATAAGGTGTGCAGATAA 
GGTTTTGCTGCTTGGGCTACATGTGGAAGGAATGAGGTATTTGGGAACAGGATGTGACAG 
CAGCTTGTCTGTGATGAGTCATCTCTCAGAACAGTTGTTCCCCCATCCCCACCCCCAACC 
TG GGACCCCTCCCTTTTTGTTGTTTACTTATCTTATG AGAACTTCACAGTCAGTGCTGT C 

60 3 5 6 CCTGAAATGTTAATCACTGTTT G AAGTG CG AG CTGGGTG GAG AGTCAGGG AGGTC CGCAC 

TC CTCCAG GGCTTCACATG C CATCATTTTTGTGATTG AGAAGG ATCATG CTG GCTG CAG A 
G CAAAGGATGGCATGGAGGG CAAG ACTGAAGGCAGGAGAAG AGTCCAAGTG CATGAG CCA 
GAGTGGTGCAGGGAGAATAGATACTGAGTGTGGGAACTGAGGAAGAGAAGGGGCTCAAGG 
ATAWCCCAGTTTTCTAATTCAAATGCATGAAGCTTTCATCAACCAAAAATACATCACAT 
[G,A] 

GAGGGTAATGGGGTCGGGAGAGACAAGGTAGTGATCTAAATTTGGAACATGTTGAGATTT 
AG GTCTAT AGAG CATCAGTTGCAG ATTCTATATAAGACTGAAGGCCTGGGGCATATCAGG 
GATAAAGATATAGCTTGGTGGCCCTTAGCATATCCGTGG I I I I I AACTTTGGTG ATGGTC 

AAAATACCTATGCAGAAGGACTGGAGTGAGAAGGAAATGGAGCTTAGGACATAACCCTAC 
CACTATATAAACAAACTTTGGAGAATCAGGAGAGAGTAAAGCCAAAGGAGGAGAGACAGG 

6164 3 TC T ACTG AAAATG G CATGTTTG I I I I i I I CTTTATAAGACTGATTTAAATCAAACCTTTG 

CCTGTAATATGTATTGCAAATGTTTTCCTCAGTrGGTTGTCAGATCTCATTTATAGTAAT 
AACAGCAAATATATATGAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTTGTGT 

ATTCATCCACTTAGG AATAAATTTTATG AG AATTGTG CG GCATATAG AAAGAAAACTGTA 
AAACCTTACTG AG GTATTTACAG ACCACTTG AATAAATGG AG AGAAATAACG GTGCT ATA 
CC.T] 

ATTGG AAATATTTTTTC CAAATAAATATTG CAGTATCGTTGTCTG AGGT ATTACCCAGAA 
CTCTTTGTCTCACGACCAAAAGAATAAGGAGGGTGGACAGTAAGGGTGAGTTTGGACCGA 
AAATTTAATAAACAAAAGAGGAAAGCTCTTCACTGTGGAGAGGGGACCCAAGAGGGTTGC 
CATTTCACAGCTGAGTACAAAGGCTTTTATGAGGAACCTGATAGGGCTGGGGGTTTCATT 
TGCATAAGGCATGAATTTCTGGCAGCTCCACCCTGTTCTCCTAGTATGCTGACTGGCTAG 

62221 CTCCTAGTATGCTGACTGGCTAGGGGTTGTTnTGGAAAAGGCACCACTCAGAAAATGAC 
ATGATGGTTGACCAGGCATGGTAGTTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAG 
GTGGGCAGATCTCTCAAGGCCAGGAGTTCGAGACTAGCCTGGCCAATATGGCTAAAGCCC 
ATCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGTGCACACCTGTAATCTCAGCT 
ACTTGGGAGGCTGAGCCACAAGAATCACTTGGACCTGGGAGGTGAAGGTTGCAGTGAGAC 
[G,A] 

AGATTGTGCCACCACACTCCAGTCACACTCCAGCTGGGTGACAGAGCAAGCAAGACTCCA 
TCTCAAAAAAAAAAAAAAAAAAAATGACGTGGTGTAAAGACCAGTTGGAGCCTTGGCCCA 
CAACCAGCTGAGTGTTGGAGTGATGGTTCACAGAGGCTTGGCTCACAGTCCAAAGTATGC 
CCCAAAAAGGAAAGGAATGTGCTCACTGGGGCCCACCATGTACATGCCCACAAAAGGAGA 
AG G AACTATTTG CTAG AGG CCCACTG ATTG CACAAAG AACAAAGGCATTTCTGTGTTGG A 

62306 TCATGC CTGTAATCCCAG CACTTTGGG AGG CTG AGGTGGG CAG ATCTCTC AAGGC CAGG A 

GTTCGAGACTAGCCTGGCCAATATGGCTAAAGCCCATCTCTACTAAAAATACAAAAATTA 
GCCAGGTGTGGTGGTGCACACCTGTAATCTCAGCTACTTGGGAGGCTGAGCCACAAGAAT 
CACTTGGACCTGGGAGGTGAAGGTTGCAGTGAGACGAGATTGTGCCACCACACTCCAGTC 

GACGTGGTGTAAAGACCAGTTGGAGCCTTGGCCCACAACCAGCTGAGTGTTGGAGTGATG 
GTTCACAGAGGCTTGGCTCACAGTCCAAAGTATGCCCCAAAAAGGAAAGGAATGTGCTCA 
CTGGGGCCCACCATGTACATGCCCACAAAAGGAGAAGGAACTATTTGCTAGAGGCCCACT 
GATTGCACAAAGAACAAAGGCATTTCTGTGTTGGACTTTGCTCCCTTATCTGTGCAGCTG 
TGGGCATGTTTTAGGCAAGCTTCCTGTGCTAGTTCCCTTATCTGTGTCTGCAGCTTGATT 

63224 GAAAGTTAGAAACAGATCCTAGTATGTATCATAATTCAGCACAAATGAAAAGTAACATCA 
C AAGTCAG CGTG AAAAG AAAGG ATTATTCAG ATAAATGCTG CTGGG C CAATTGGTTAACA 
GTTTGGGGAAGATTGTGAAATCAGACCCTATATAATATGATACAACAAAATAAATTTTTT 
AAAAAAGAGTTATAT GTAAAAAGTTATACATTAGAAAATGAAATAAAAGAACATAGGTCA 
i I I I I I I 1 I I I I I 1 I GAG ACAG CGTCTCACTCTGTCACCAAG GCTG GAGTG C AAAGG CGT 
[G,T] 

ATCTCGGCTCACTGCAAACTCCGCCTTCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCC 
CGAGTAGCTGGGACTACAGGCACCCGCTACCACGCCCAGCTAATTTTTATA I I I 1 IGATA 
GAGACGGGGTTTCACCATGTTGGCCAGGATGGTTTCGATCTCTTGACCTTGTGATCCGCC 
C G CCTCGG C CTC CC AAAGTG CTG AG ATTAC AG GCGTG AG CC ACTGC ACCCGG CCG AGTTA 
A I I I I I I I I GAACAGGGAAGAGCTATCTGTTCAAAATACATAGAAAAAAAAACCACAGAA 
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63 3 50 GGAAGATTGTGAAATCAGACCCTATATAATATGATACAACAAAATAAATTTTTTAAAAAA 
GAGTTATATGTAAAAAGTTATACATTAGAAAATGAAATAAAAGAACATAGGTCATTTTTT 
I I I I I I I I IGAGACAGCGTCTCACTCTGTCACCAAGGCTGGAGTGCAAAGGCGTGATCTC 
GG CTCACTG CAAACTCCG CCTTCTGGGTTCAAGCG ATTCTC CTGCCTCAGCCTCCCGAGT 
AG CTGGGACTACAGGCACCCGCTACCACGC CCAG CTAATTTTTATATTTTTG ATAGAGAC 
[G,A] 

GGGTTTCACCATGTTGGCCAGGATGGTTTCGATCTCTTGACCTTGTGATCCGCCCGCCTC 
GGCCTCCCAAAGTGCTGAGATTACAGGCGTGAGCCACTGCACCCGGCCGAGTTAA I I I I I 
TTTGAACAG GG AAG AG CTATCT GTTCAAAATACATAGAAAAAAAAACCACAG AATAAATT 
AGTAATAATTCAACTTTAACAACAAAAAGCTGTAATAAAGCAAATCATACTAACCCCT 

63517 AAGGCGTGATCTCGGCTCACTGCAAACTCCGCCTTCTGGGTTCAAGCGATTCTCCTGCCT 
CAGC CTCCCGAGTAGCTGGGACTACAGGCACCCGCTACCACGCCCAGCTAAI I I 1 IATAT 
TTTTGATAGAGACGGGGTTTCACCATGTTGGCCAGGATGGTTTCGATCTCTTGACCTTGT 
GATCCGCCCGCCTCGGCCTCCCAAAGTGCTGAGATTACAGGCGTGAGCCACTGCACCCGG 
CCGAGTTAA I I I I I I I I GAACAGG GAAG AG CTATCTGTTCAAAATACATAGAAAAAAAAA 
[C,A] 

CA CAGAAT AAATT AGT AAT AATTC AACTTT AA CAACAAAAA G CTGT AAT AAAG CAAATCA 
TACTAACCCCT 
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